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TRANSMITTAL LETTER TO THE UNITED StJ^S ^/ATTORNErS DOCKET NUMBER 51241 

DESIGNATED/ELECTED OFFICE (DO/EO/US) X^TRaO^^ .^-^'l^^&v 
CONCERNING A FILING UNDER 35 U.S.C. 371 CTs. APPLICATIo|Ng (l^krgvn^^ ^) 

INTERNATIONAL APPLICATION NO. INTERNATIONAL FILING DATE PRIORITY DATE CLAIMED 
PCT/EPOO/07252 27 July 2000 10 March 2000 

27 July 1999 

TITLE OF INVENTION: MODIFIED CYTOCHROME P450 MONOOXYGENASES 

APPLICANT(S) FOR DO/EO/US Bemhard HAUER. Juergen PLEISS, Ulrich SCHWANEBERG, Jutta SCHMITT 

Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following 
items and other information: 

1 . /X/ This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

2. / / This is a SECOND or SUBSEQUENT submission of Items concerning a filing under 35 U.S.C. 371 . 

3. /X/ This express request to begin national examination procedures (35 U.S.C.371 (f)) at any time rather than delay examination until 

the expiration of the applicable time limit set in 35 U.S.C. 371 (b) and POT Articles 22 and 39(1). 

4. /k / A proper Demand for International Preliminary Examination was made by the 19th month from the earliest claimed priority date. 

5. /X/ A copy of the International Application as filed (35 U.S.C. 371 (c)(2)). 

a. /X/ is transmitted herewith (required only if not transmitted by the Intemational Bureau). 

b. / / has been transmitted by the International Bureau, 

c. / / Is not required, as the application was filed in the United States Receiving Office (RO/USO). 

6. /X/ A translation of the International Application into English (35 U.S.C. 371 (c)(2)). 

" 7. / / Amendments to the claims of the Intemational Application under PCT Article 1 9 (35 U.S.C. 371 (c)(3)). 

a. / / are transmitted herewith (required only if not transmitted by the International Bureau). 

b. / / have been transmitted by the International Bureau, 

c. / / have not been made; however, the time limit for making such amendments has NOT expired. 
6 J I have not been made and will not be made. 

8. / / A translation of the amendments to the claims under PCT Article 1 9(35 U.S.C. 371 (c)(3)). 

9. /X / An oath or declaration of the inventor(s)(35 U.S.C. 171(c)(4)). 

10. // A translation of the annexes to the Intemational Preliminary Examination Report under PCT Article 36 (35 U.S.C. 371 (c)(5)). 
Items 1 1 . to 1 6. below concern other document(s) or information Included: 

11. // An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

12. / X / An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 

1 3i X / A FIRST preliminary amendment. 
il A SECOND or SUBSEQUENT preliminary amendment. 

14. // A substitute specification. 

15. // A change of power of attorney and/or address letter. 

1 6. /)( / Other Items or information. 

International Search Report 

Intemational Preliminary Examination Report 



U.S. Alpli 



TIONALAPPLN. NO. 
10/07252 



ATTORNEYS DOCKET NO. 
51241 



17. /X/ The following fees are submitted 

BASIC NATIONAL FEE (37 CFR 1 .492(a)(1 H5)): 

Search Report has been prepared by the 

EPO or JPO $890.00 

International preliminary examination fee paid to USPTO 
(37 CFR 1.482) $710.00 

No International preliminary examination fee paid to 
USPTO (37 CFR 1 .482) but international search fee paid 
to USPTO (37 CFR 1.445(a)(2)) $740.00 

Neither international preliminary examination fee 

(37 CFR 1 .482) nor International search fee 

(37 CFR 1 .445(a)(2)) paid to USPTO $ 1 .040.00 

International preliminary examination fee paid to 
USPTO (37 CFR 1 .482) and all claims satisfied pro 
-visions of PCT Article 33(2)-(4) $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS 



890.00 



PTO USE ONLY 



890.00 



Surcharge of $1 30.00 for furnishing the oath or declaration 
later than / / 20 / /30 months from the earliest 
claimed priority date (37 CFR 1 .492(e)). 



Claims 



Number Filed 



Number Extra 



Total Claims 17 -20 

Indep.Claims 1 -3 

Multiple dependent claim(s)(if applicable) 



Rate 

X$18. 
X$84. 
+280. 



TOTAL OF ABOVE CALCULATION 



890. 



Reduction of 1/2 for filing by small entity, if applicable. 
Verified Small Entity statement must also be filed 
(Note 37 CFR 1 .9. 1 .27. 1 .28). 



SUBTOTAL 



Processing fee of $1 30. for furnishing the English 
translation later than / /20 / /30 months from the 
earilest claimed priority date (37 CFR 1 .492(f)). + 



890. 



TOTAL NATIONAL FEE 



890. 



Fee for recording the enclosed assignment (37 CFR 1 .21 (h)). 
The assignment must be accompanied by an appropriate cover 
sheet (37 CFR 3.28. 3.31 ) $40.00 per property 
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TOTAL FEES ENCLOSED 



= $ 930.00 

Amount to be 

refunded: $ 

Charged $ 



a./X/ A check in the amount of $ 930.00 to cover the above fees Is enclosed. 



b.// 



Please charge my Deposit Account No. 
is enclosed. 



Jn the amount of$ 



_ to cover the above fees. A duplicate copy of this sheet 



c./X/ 



The Commissioner Is hereby authorized to charge any additional fees Nwhich may be required, or credit any overpayment to Deposit 
Account No. 11-0345 . A duplicate copy of this sheetis enclosed. 



NOTE: Where an appropriate time limit under 37 CFR 1 .494 or 1 .495 has not been met, a petition to revive (37 CFR 1 .1 37(a) or (b) must 
be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO : 
KEIL& WEINKAUF 
1101 Connecticut Ave., N.W. 
Washington. D. C. 20036 



SIGNATURE 
Herbert B. Keil 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
In re the Application of 

HAUER et al. ) BOX POT 

International Application 
PCT/EP 00/07252 

Filed: July 27, 2000 

For: MODIFIED CYTOCHROME P450 MONOOXYGENASES I 

i 
I 

PRELIMINARY AMENDMENT 
Honorable Commissioner of ! 

Patents and Trademarks ' 
Washington, D.C. 20231 [ 

Sir: 

Prior to examination, kindly amend the above-identified application as follows: 
IN THE CLAIMS 

Kindly amend the claims as shown on the attached sheets. 

REMARKS 

The claims have been amended to eliminate multiple dependency and to place them in 
better form for U.S. filing. No new matter is included. 
A clean copy of the claims is attached. 
Favorable action is solicited. 

Respectfully submitted. 
KEIL& WEINKAUF 




Herbert B. Keil 
Reg. No. 18,967 



1101 Connecticut Ave., N.W. 
Washington. D.C. 20036 



(202)659-0100 
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CLEA/V VERSION OF AMENDED CLAIMS - OZ 51241 

7. A nucleic acid sequence encoding a monooxygenase as claimed in claim 1 and 

the complementary nucleic acid sequence thereof. 
1 2. (amended) A process for the enzymatic production of terminally or subterminally 

hydroxylated aliphatic carboxylic acids, which comprises 

a1 ) culturing a recombinant microorganism which has been transformed with 
a vector which encompasses an expression construct comprising, under 
the genetic control of regulatory nucleic acid sequences, a sequence 
which encompasses a nucleic acid sequence encoding the 
monooxygenase of claim 1 in the presence of a culture medium which 
contains at least one hydroxylatable carboxylic acid or at least one 
hydroxylatable carboxylic acid derivative; or 

a2) incubating a reaction medium containing at least one hydroxylatable 
carboxylic acid or at least one hydroxylatable carboxylic acid derivative 
with an enzyme as claimed In claim 1 , and 

b) isolating the resulting hydroxylated product from the medium. 
14. A method as claimed in claim 13, wherein the hydroxylatable carboxylic acid is a 

Ca-Cij-monocarboxylic acid or a derivative thereof and the monooxygenase used 

comprises at least one of the following amino acid substitution patterns: 

a) F87V; 

b) F87AL188K; 

c) F87VL188K; 



1 
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CLEAN VERSION OF AMENDED CLAIMS - OZ 51241 

d) F87AL188KA74G; 

e) F87VL188KA74G; 

f) F87A L1 88K A74G R47F; 

g) F87VL188KA74G R47F; 

h) F87A L1 88K A74G R47F V26T; or 

i) F87V L1 88K A74G R47F V26T. 

16. A method as claimed in claim 12, wherein the reaction is carried out in the 
presence of an electron donor or a reduction equivalent. 



MARKED VERSION OF AMENDED CLAIMS - OZ 51241 

7. (amended) A nucleic acid sequence encoding a monooxygenase as claimed in 
[any of the preceding claims] claim 1 and the complementary nucleic acid 
sequence thereof. 

12. (amended) A process for the enzymatic production of terminally or subterminally 
hydroxylated aliphatic carboxylic acids, which comprises 
a1 ) culturing a recombinant microorganism [as claimed in claim 10 or 1 1 
according to the invention] which has been transformed with a vector 
which encompasses an expression construct comprising, under the 
genetic control of regulatory nucleic acid sequences, a sequence which 
encompasses a nucleic acid sequence encoding the monooxygenase of 
claim 1 in the presence of a culture medium which contains at least one 
hydroxylatable carboxylic acid or at least one hydroxylatable carboxylic 
acid derivative; or 

a2) incubating a reaction medium containing at least one hydroxylatable 

carboxylic acid or at least one hydroxylatable carboxylic acid derivative 
with an enzyme as claimed in [any of claims] claim 1 [to 6], and 

b) isolating the resulting hydroxylated product from the medium. 
14. A method as claimed in claim 1 3, wherein the hydroxylatable carboxylic acid is a 

Cs-Cij-monocarboxylic acid or a derivative thereof and the monooxygenase used 

[is a mutant as claimed in claim 5] comprises at least one of the following amino 

acid substitution patterns: 
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MARKED VERSION OF AMENDED CLAIMS - OZ 51241 



a) 


F87V: 


b) 


F87AL188K: 


c} 


F87V L188K: 


dl 


F87AL188KA74G: 


e) 


F87V L188KA74G: 


a 


F87A L188K A74G R47F: 


a) 

h) 

n 


F87V L188K A74G R47F: 


F87A L188K A74G R47F V26T: or 


F87V L188K A74G R47F V26T. 



16. (amended) A method as claimed in [any of claims] claim 12 [to 15], wherein the 
reaction Is carried out in the presence of an electron donor or a reduction 
equivalent. 
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CLAIMS AS FILED - OZ 51241 

1 . A modified cytochrome P450 monooxygenase which, in comparison with the 
wild-type enzyme, shows an altered substrate profile in the terminal and/or 
subterminal enzymatic hydroxylation of aliphatic carboxylic acids, owing to site- 
specific mutagenesis of its substrate binding region. 

2. A monooxygenase as claimed in claim 1 , which is derived from cytochrome P450 
monooxygenases of bacterial origin. 

3. A monooxygenase as claimed in claim 2, which is derived from Bacillus 
megaterium cytochrome P450 monooxygenase BM-3 with an amino acid 
sequence in accordance with SEQ ID NO: 2, which has at least one functional 
mutation in one of the following amino acid sequence regions: 24-28, 45-51, 70- 
72, 73-82. 86-88, 172-224 und 352-356, with the proviso that, if the enzyme 
carries the mutation F87A, more than one of these regions is mutated. 

4. A monooxygenase as claimed in claim 3, which comprises at least one functional 
mutation in the amino acid sequence regions 86-88 and 172-224. 

5. A monooxygenase as claimed in claim 4, which compnses at least one of the 
following amino acid substitution patterns: 

a) F87V; 

b) F87AL188K; 

c) F87VL188K; 

d) F87AL188KA74G; 

e) F87VL188KA74G; I 
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CLAIMS AS FILED - OZ 5Y24f 

f) F87AL188KA74G R47F; | 

g) F87VL188KA74G R47F; , 

h) F87A LI 88K A74G R47F V26T; or 

i) F87V L1 88K A74G R47F V26T; 
and functional equivalents thereof. 

6. A monooxygenase as claimed in claim 3, which comprises a single amino acid 
substitution from amongst the following: 

a) V26T, 

b) R47F, 

c) S72G, 

d) A74G, 

e) F87V. 

f) L188z, where Z is an amino acid selected from amongst K, R, W, Q, N, G. 
A and S. and 

g) M354T; 

and functional equivalents thereof. 

7. A nucleic acid sequence encoding a monooxygenase as claimed in claim 1 and 
the complementary nucleic acid sequence thereof. 

8. An expression construct comprising, under the genetic control of regulatory 
nucleic acid sequences, an encoding sequence which encompasses a nucleic 
acid sequence as claimed in claim 7. 

2 
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CLAIMS AS FILED - OZ 5f 24f 

9. A vector which encompasses at least one expression construct as claimed in 
claim 8. 

10. A recombinant microorganism which has been transformed with at least one 
vector as claimed in claim 9. 

11. A microorganism as claimed in claim 1 0, selected from amongst bacteria of the 
genus Escherichia. 

12. A process for the enzymatic production of terminally or subterminally 
hydroxylated aliphatic carboxylic acids, which comprises 

a1 ) culturing a recombinant microorganism which has been transformed with 
a vector which encompasses an expression construct comprising, under 
the genetic control of regulatory nucleic acid sequences, a sequence 
which encompasses a nucleic acid sequence encoding the 
monooxygenase of claim 1 in the presence of a culture medium which 
contains at least one hydroxylatable carboxylic acid or at least one 
hydroxy latable carboxylic acid derivative; or 

a2) incubating a reaction medium containing at least one hydroxylatable 

carboxylic acid or at least one hydroxylatable carboxylic acid derivative 
with an enzyme as claimed in claim 1 , and 

b) isolating the resulting hydroxylated product from the medium. 

13. A method as claimed in claim 12, wherein the hydroxylatable carboxylic acid is 
a Cg-Cso monocarboxylic acid or a derivative thereof. 

3 



CLAIMS AS FILED - OZ 51241 

14. A method as claimed in claim 13, wherein the hydroxyiatable carboxyiic acid is a 
C8-Ci2-nrionocarboxylic acid or a derivative thereof and the monooxygenase used 
comprises at least one of the following amino acid substitution patterns: 



a) 


F87V; 


b) 


F87A L188K; 


c) 


F87V L188K; 


d) 


F87AL188KA74G; 


e) 


F87V L188KA74G; 


f) 


F87A L188K A74G R47F; 


g) 


F87V L188KA74G R47F; 


h) 


F87A L188K A74G R47F V26T; or 


i) 


F87V L188K A74G R47F V26T. 



15. A method as claimed in claim 13, wherein the hydroxyiatable carboxyiic acid is a 
Ci2-C3o-monocarboxylic acid or a derivative thereof and the monooxygenase 
employed is a mutant selected from amongst the single mutants F87A, F87V, 
V26T, S72G, A74G and M354T, and the multiple mutants 

F87A L1 88K A74G R47F; I 

j 

F87V L1 88K A74G R47F; 

F87A L1 88K A74G R47F V26T; or 

F87V L1 88K A74G R47F V26T. 

16. (amended) A method as claimed in [any of claims] claim 12 [to 1 5], wherein the 
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CLAIMS AS FILED - OZ 51241 \ 

reaction is carried out in the presence of an electron donor or a reduction 
equivalent. 

17. A method as claimed in claim 16, wherein the electron donor or the reduction 
equivalent is selected from amongst NADH, NADPH and zn/CO(l!l) sepulchrate. 
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Modified cytochrome P450 monooxygenases 

The present invention relates to modified cytochrome P450 
5 monooxygenases with an altered substrate profile, to nucleic acid 
sequences coding therefor, to expression constructs and vectors, 
to recombinant microorganisms which comprise these vectors, and 
to processes for the microbiological production of terminally or 
subterminally hydroxylated aliphatic carboxylic acids . 

10 

The monooxygenase with the ncime P450 BM-3 is a cytochrome P450 
enzyme from Bacillus megaterlum and has a pronounced sequence 
homology with mammalian P450 enzymes (1). Owing to these 
correspondences, P4 50 BM-3 constitutes an excellent model system 

15 for this class of P450 enzymes. P450 BM-3 hydroxylates mainly 

long-chain saturated fatty acids on their co-1, (0-2 and (o-3 carbon 
atom. Amides or alcohol analogs and epoxides of long-chain 
unsaturated fatty acids are also converted (1-3). The catalytic 
activity for saturated fatty acids depends on the chain length, 

20 the chain length optimum being 14 to 16 carbon atoms. The enzyme 
shows no catalytic activity for fatty acids with a chain length 
of less than 12 carbon atoms (1). 



Summary of the invention 



It is an object of the present invention to provide cytochrome 
P450 monooxygenase mutants which show a modified substrate 
profile in comparison with the wild-type enzyme. In particular, 
it was an object to provide novel mutants which hydroxylate 

30 saturated aliphatic carboxylic acids in another chain position 
and/or have a modified substrate specificity. In particular, it 
was an object to provide mutants which have a catalytic activity 
for aliphatic carboxylic acids of medium chain length, in 
particular with a chain length of 8 to 12, such as, for example, 

35 8 to 10, carbon atoms, and which hydroxylate these carboxylic 

acids subterminally, especially at the positions co-1, (o-2 and/or 
co-3 . 



We have found that this object is achieved by providing modified 
40 cytochrome P450 monooxygenases which, owing to a combination of 
directed evolution and site-specific mutagenesis of their 
substrate-binding region, show an altered reactivity pattern or 
substrate profile in the terminal and/or subterminal enzymatic 
hydroxylation of aliphatic carboxylic acids in comparison with 
45 the wild type. 
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Detailed descripliion of the invention 

Compared to the wild- type enzyme, an "altered substrate profile" 
is observed in mutants according to the invention. An "altered 
substrate profile" means, for the purposes of the present 
invention, a) an improved reactivity such as, for example, an 
increase in specific activity (expressed as nmol reacted 
carboxylic acid/minute/nmol P450-en2yme) and/or at least one 
kinetic parameter selected from amongst Kcat, Km and Kcat/Km, for 
example an improvement of at least 1%, such as, for example, 10 
to 1000%, 10 to 500%, or 10 to 100%, of the mutant for at least 
one hydroxylatable aliphatic carboxylic acid or a hydroxylatable 
derivative of an aliphatic carboxylic acid, and/or b) an altered, 
in particular increased, regioselectivity in the hydroxylation of 
the carboxylic acid. There is thus, for example, a shift of the 
preferred terminal or subterminal (to-l, a>--2, co-3, co-4 , in 
particular a)-l to a)-3) hydroxylation position at at least one 
hydroxylatable carboxylic acid or a hydroxylatable derivative of 
an aliphatic carboxylic acid. Hydroxylatable aliphatic carboxylic 
acids or derivatives thereof in which an "altered substrate 
profile" can be observed in accordance with the invention are 
branched or, preferably, straight-chain carboxylic acids having 8 
to 30 carbon atoms. The alteration according to the invention of 
the substrate profile may be manifested over the entire length 
(i.e. C8-C30) or only in sections, for example in the case of 
C8-C12-, C10-C12-/ C12-C30-/ C12-C25- or C12-C20 carboxylic acids or 
individual carboxylic acids from amongst these sections. 

Nonlimiting examples which may be mentioned of carboxylic acids 
30 which can be hydroxylated in accordance with the invention are: 
caprylic acid, pelargonic acid, capric acid, undecanoic acid, 
lauric acid, tridecanoic acid, myristic acid, pentadecanoic acid, 
palmitic acid, margaric acid, stearic acid, nonadecanoic acid, 
arachidic acid, behenic acid, lignoceric acid, cerotic acid and 
35 melissic acid. Examples of suitable carboxylic acid derivatives 
are Ci-C4-alkyl esters, amides or anhydrides with, preferably, 
short-chain Ci-C4-carboxylic acids. 

The monooxygenases according to the invention are preferably 
40 derived from cytochrome P450 monooxygenases from the enzyme class 
E.G. 1.14.-.-, in particular from the P450 family CYP102, and are 
of eukaryotic or prokaryotic, in particular bacterial, origin. 

An especially preferred group of mutants is derived from Bacillus 
45 inegateriL2i7i cytochrome P450 monooxygenase BM-3 with an amino acid 
sequence as shown in SEQ ID NO: 2, which has at least one 
functional mutation in one of the following amino acid sequence 



\^.Lh ^Ji -j1 -uAi "5 ^ ... IH;:'! 

0050/51241 

3 

regions: 24-28, 45-51, 70-72, 73-82 (helix 5), 86-88 (helix 6), 
172-224 (F/G loop) and 352-356 (P-strand 8), with the proviso 
that, if the enzyme carries the mutation F87A, more than one of 
these regions is mutated; and functional equivalents of these 
5 mutants. 

In the present description, the mutation in question is shown in 
the amino acid one-letter code. The original amino acid is shown 
before the number which indicates the sequence position of the 
10 mutation, while the modified amino acid is shown after the 
number . 

A "functional mutation" for the purposes of the present invention 
encompasses an amino acid exchange in the abovementioned sequence 
15 regions which results in an "altered reactivity pattern" or 
"altered substrate profile" in accordance with the above 
definition - 

Especially preferred in accordance with the invention are P450 
20 BM-3 monooxygenase mutants (group (A)) which comprise 

individually or in combination in at least each case one 
functional mutation in the cimino acid sequence regions 86-88, 
172-224, 73-82, 45-51 and 24-28 (in accordance with SEQ ID NO: 
2). 

25 

Thus, for example, Phe87 may be replaced by an amino acid with an 
aliphatic side chain such as, for example, Ala, Val, Leu, in 
particular Val or Ala; Leul88 may be replaced by an amino acid 
with an amine or amide side chain such as, for example, Asn, Gin, 

30 Arg or, in particular, Lys, or amino acids such as Ala, Gly, Ser 
and Trp; Ala74 may be replaced by another amino acid with an 
aliphatic side chain, such as, for example, Val and, in 
particular, Gly; Arg4 7 may be replaced by an amino acid with a 
cyclic side group such as, for example. His, Tyr or, in 

35 particular, Phe; and Val2 6 can be replaced by an amino acid with 
a hydroxyl side group such as, for example, Ser or, in 
particular, Thr. 

Preferred group (A) mutants exhibit at least one of the following 
40 amino acid substitution patterns: 



a) F87V; 

b) F87A, L188K; 

c) F87V, L188K; 

45 d) F87A, L188K; A74G; 

e) F87V, L188K, A74G; 

f) F87A, L188K, A74G, R47F; 
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g) F87V, L188K, A74G, R47F; 

h) F87A, L188K, A74G, R47F, V26T; or 

i) F87V, L188K, A74G, R47F, V26T; 



5 and functional equivalents thereof, 

A further group, group (B) , of suitable mutants exhibits a single 
amino acid substitution in one of the abovementioned sequence 
regions or in one of the sequence regions 70-72 and 352-356. In 
10 the two last-mentioned sequence regions, Ser72, for example, may- 
be replaced by an amino acid with an aliphatic side chain such 
as, for example, Ala, Val, Leu, lie and, in particular, Gly, and 
Met354 may be replaced by an amino acid with a hydroxyl side 
group such as, for example, Ser or, in particular, Thr. 

15 

In particular, mutants of group B exhibit an cimino acid 
substitution from amongst the following: 



a) V26T, 

20 b) R47F, 

c) S72G, 

d) A74G, 

e) F87V, 

f ) Ll88z, where z is K, R, W, Q, N, G, A or S, 
25 and 

g) M354T; 



and functional equivalents thereof. 

30 "Functional equivalents'' are to be understood as meaning in 

accordance with the invention mutants which exhibit, in at least 
one of the abovementioned sequence positions, an amino acid 
substitution other than the one mentioned specifically, but still 
lead to a mutant which, like the mutant which has been mentioned 

35 specifically, show an "altered substrate profile" in accordance 
with the above definition over the wild type. Functional 
equivalence exists in particular also in the case where the 
alternations in the reactivity pattern correspond qualitatively. 

40 ''Functional equivalents" naturally also encompass P450 

monooxygenase mutants which, like the P450 BM3 mutants which have 
been mentioned specifically, can be obtained by mutating P450 
enzymes from other organisms. For example, regions of homologous 
sequence regions can be identified by sequence comparison. 

45 Following the principles of what has been set out specifically in 
the invention, the modern methods of molecular modeling then 
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allow equivalent mutations to be carried out which affect the 
reaction pattern. 

"Functional equivalents" also encompass the mutants which can be 
5 obtained by one or more additional amino acid additions, 

substitutions, deletions and/or inversions, it being possible for 
the abovementioned additional alterations to occur in any 
sequence position as long as they give rise to a mutant with an 
"altered substrate profile" in the above sense. 

10 

The present invention furthermore relates to nucleic acid 
sequences encoding a mutated monooxygenase or a "functional 
equivalent" in accordance with the above definition. These 
sequences can preferably be obtained from SEQ ID NO: 1 by codon 
15 exchange in accordance with the above conino acid substitution 
patterns . 

The invention also encompasses those nucleic acid sequences which 
encompass so-called silent mutations or which are altered in 

20 comparison with a specifically mentioned sequence in accordance 
with the codon usage of a specific origin or host organism, and 
naturally occurring variants of such nucleic acid sequences . Also 
encompassed by the invention are deviations of the nucleic acid 
sequences obtained by the degeneracy of the genetic code (i.e. 

25 without altering the corresponding amino acid sequence) or by 
conservative nucleotide substitution (i.e. the corresponding 
amino acid is replaced by another amino acid with the Scune 
charge, size, polarity and/or solubility), and also sequences 
altered by the addition, insertion, inversion or deletion of 

30 nucleotides and which encode a monooxygenase according to the 
invention with an "altered substrate profile", and the 
corresponding complementary sequences* 

The invention furthermore relates to expression constructs 
35 comprising a nucleic acid sequence encoding a mutant according to 
the invention under the genetic control of regulatory nucleic 
acid sequences; and vectors encompassing at least one of these 
expression constructs . 

40 Preferably, the constructs according to the invention encompass a 
promoter 5 '-upstream and a terminator sequence 3 ' -downstream of 
the encoding sequence in question, and, if appropriate, other 
customary regulatory elements, in each case operatively linked 
with the encoding sequence. Operative linkage is to be understood 

45 as meaning the sequential arrangement of promoter, encoding 
sequence, terminator and, if appropriate, other regulatory 
elements in such a manner that each of the regulatory elements 
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can fulfill its intended function on expression of the encoding 
sequence. Examples of operatively linkable sequences are 
targeting sequences, or else translation enhancers, 
polyadenylation signals, selection markers, amplification 
5 signals, replication origins and the like. 

In addition to the artificial regulatory sequences, the natural 
regulatory sequence can still be present upstream of the actual 
structural gene. If desired, this natural regulation may also be 

10 switched off by genetic alteration, and the expression of the 

genes may be enhanced or lowered- However, the gene construct may 
also be simpler in construction, i.e. no additional regulatory 
signals are inserted upstream of the structural gene and the 
natural promoter with its regulation is not removed. Instead, the 

15 natural regulatory sequence is mutated in such a way that 
regulation no longer takes place and the gene expression is 
increased or reduced. One or more copies of the nucleic acid 
sequences may be present in the gene construct. 

20 Examples of promoters are: cos, tac, trp, tet, trp-tet, Ipp, lac, 
Ipp-lac, laclq, T7 , T5, T3, gal, trc, ara, SP6, 1-PR or 1-PL 
promoter, all of which are advantageously employed in 
Grcun- negative bacteria; and the Grcim-positive promoters cimy and 
SP02, the yeast promoters ADCl, MFa, AC, P-60, CYCl, GAPDH or the 

25 plant promoters CaMV/35S, SSU, OCS, lib4, usp, STLSl, B33, nos, 
or the ubiquitin or phaseolin promoter. 

In principle, all natural promoters with their regulatory 
sequences can be used. In addition, synthetic promoters may also 
30 be used in an advantageous fashion. 

The abovementioned regulatory sequences are intended to allow the 
directed expression of the nucleic acid sequences. Depending on 
the host organism, this may mean, for example, that the gene is 
35 expressed or overexpressed only after induction has taken placed, 
or that it is expressed and/or overexpressed immediately. 

The regulatory sequences or factors can preferably have a 
positive effect on expression and in this manner increase or 
40 reduce the latter. Thus, an enhancement of the regulatory 

elements may advantageously take place at the transcriptional 
level by using strong transcription signals such as promoters 
and/or "enhancers". In addition, translation may also be enhanced 
by improving, for example, mRNA stability. 



45 
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An expression cassette is generated by fusing a suitable promoter 
with a suitable monooxygenase nucleotide sequence and a 
terminator signal or polyadenylation signal- To this end, 
customary recombination and cloning techniques are used as they 
5 are described, for example, by T, Maniatis, E.F. Fritsch and J. 
Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1989) and by T.J. 
Silhavy, M.L. Berman and L.W. Enquist, Experiments with Gene 
Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
10 (1984) and by Ausubel, F.M. et al.. Current Protocols in 
Molecular Biology, Greene Publishing Assoc. and Wiley 
Interscience ( 1987 ) . 

For expression in a suitable host organism, the recombinant 
15 nucleic acid construct or gene construct is advantageously 

inserted into a host-specific vector which allows optimal gene 
expression in the host. Vectors are well known to the skilled 
worker and can be found, for example, in "Cloning Vectors" 
(Pouwels P. H. et al., Ed., Elsevier, Amsterdam-New York-Oxford, 
20 1985). Vectors are to be understood as meaning not only plasmids, 
but all other vectors known to the skilled worker such as, for 
example, phages, viruses such as SV40, CMV, baculovirus and 
adenovirus, transposons, IS elements, phasmids, cosmids, and 
linear or circular DNA. These vectors can be replicated 
25 autonomously in the host organism or chromosomally . 

The vectors according to the invention allow the generation of 
recombinant microorganisms which can be transformed, for example, 
with at least one vector according to the invention and which can 

30 be employed for producing the mutants. The above-described 
recombinant constructs according to the invention can 
advantageously be introduced into a suitable host system and 
expressed. It is preferred to use usual cloning and transfection 
methods known to the skilled worker, for example coprecipitation, 

35 protoplast fusion, electroporation, retroviral transfection and 
the like, in order to bring about expression of the 
abovementioned nucleic acids in the expression system in 
question. Suitable systems are described, for example, in Current 
Protocols in Molecular Biology, F. Ausubel et al., Ed., Wiley 

40 Interscience, New York 1997. 

Suitable host organisms are, in principle, all organisms which 
allow expression of the nucleic acids according to the invention, 
their allelic variants, and their functional equivalents or 
45 derivatives. Host organisms are to be understood as meaning, for 
example, bacteria, fungi, yeasts or plant or animal cells. 
Preferred organisms are bacteria such as those of the genera 
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Escherichia such as, for example, Escherichia coli, Streptomyces , 
Bacillus or Pseudomonas , eukaryotic microorganisms such as 
Saccharomyces cerevisiae, Aspergillus, and higher eukaryotic 
cells from animals or plants, for example Sf9 or CHO cells. 

5 

If desired, expression of the gene product may also be brought 
about in transgenic organisms such as transgenic animals such as, 
in particular, mice, sheep, or transgenic plants. The transgenic 
organisms may also be knock-out animals or plants in which the 
10 corresponding endogenous gene has been eliminated, such as, for 
excimple, by mutation or partial or complete deletion. 

Successfully transformed organisms may be selected by means of 
marker genes which are also contained in the vector or in the 

15 expression cassette. Examples of such marker genes are genes for 
resistance to antibiotics and genes for enzymes which catalyze a 
color reaction causing the transformed cell to be colored. They 
may then be selected by means of automatic cell sorting - 
Microorganisms which have been transformed successfully with a 

20 vector and which carry a suitable gene for resistance to 

antibiotics (for example G418 or hygromycin) may be selected by 
suitable liquid or solid media containing antibiotics . Marker 
proteins presented on the cell surface may be exploited for 
selection by means of affinity chromatography - 

25 

The combination of the host organisms and the vectors which match 
the organisms , such as plasmids , viruses or phages , for example 
plasmids with the RNA polymerase/promoter system, the phages X, jjl 
or other temperate phages or transposons and/or further 
30 advantageous regulatory sequences, forms an expression system. 
The term ''expression system" is to be understood as meaning, for 
exaunfiple, the combination of mammalian cells, such as CHO cells, 
and vectors, such as pcDNA3neo vector, which are suitable for 
mammalian cells. 

35 

As described above, the gene product may advantageously also be 
expressed in transgenic animals, for example mice, sheep, or 
transgenic plants. Equally, it is possible to program cell-free 
translation systems with the RNA derived from the nucleic acid. 

40 

The invention relates furthermore to processes for the 
preparation of a monooxygenase according to the invention, in 
which a monooxygenase -producing microorganism is cultured, 
monooxygenase expression is induced, if appropriate, and the 
45 monooxygenase is isolated from the culture. In this manner, the 
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monooxygenase according to the invention may also be produced on 
an industrial scale, if this is desired. 



The microorganism can be cultured and fermented by known methods. 
5 Bacteria^ for example, can be multiplied in TB or LB medium at a 
temperature of 2 0 to 4 0^0 and a pH of 6 to 9. Suitable culture 
conditions are described in detail for example by T. Maniatis, 
E.F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
10 (1989). 



Unless the monooxygenase is secreted into the culture medium, the 
cells are then interrupted, and the monooxygenase is obtained 
from the lysate by known protein isolation methods. The cells may 
15 be disrupted as desired by high-frequency ultrasound, by high 
pressure, such as, for example, in a French pressure cell, by 
osmolysis, by the action of detergents, lytic enzymes or organic 
solvents, by homogenizers , or by combining a plurality of the 
methods listed. 

20 

The monooxygenase may be purified by known methods of 
chromatography, such as molecular sieve chromatography (gel 
filtration), such as chromatography on Q-Sepharose, ion-exchange 
chromatography and hydrophobic chromatography, and by other 

25 customary methods such as ultrafiltration, crystallization, 

salting out, dialysis and native gel electrophoresis- Suitable 
methods are described, for example, by Cooper, F.G., Biochemische 
Arbeitsmethoden [Methods in Biochemistry], Verlag Walter de 
Gruyter, Berlin, New York or by Scopes, R. , Protein Purification, 

30 Springer Verlag, New York, Heidelberg, Berlin. 



To isolate the recombinant protein, it is especially advantageous 
to use vector systems or oligonucleotides which extend the cDNA 
by certain nucleotide sequences and thus encode altered 

35 polypeptides or fusion proteins for the purposes of simpler 
purification- Such modifications which are suitable are, for 
example, tags which act as anchors, for example the modification 
known as hexa-histidine anchor or epitopes which can be 
recognized as antigens of antibodies (described, for example, by 

40 Harlow, E. and Lane, D., 1988, Antibodies: A Laboratory Manual. 
Cold Spring Harbor (N.Y. ) Press). These anchors can serve to 
attach the proteins to a solid support such as, for example, a 
polymer matrix, which may be filled into a chromatography column 
or used on a microtiter plate or on any other carrier, for 

45 excimple. 
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At the same time^ these anchors may also be used for recognizing 
the proteins. To recognize the proteins, it is additionally 
possible to use conventional labels, such as fluorescent dyes, 
enzyme labels which, after reaction with a substrate, form a 
5 detectable reaction product, or radiolabels, alone or in 
combination with the anchors for derivatizing the proteins. 

The invention furthermore relates to biochemical processes for 
the enzymatic production of terminally or subterminally (co-l to 
10 co-4) hydroxylated aliphatic carboxylic acids, which comprise 

al) aerobically culturing a recombinant microorganism according 
to the invention in the presence of a culture medium which 
contains at least one hydroxylatable carboxylic acid or at 
15 least one hydroxylatable carboxylic acid derivative; or 

a2) aerobically incubating a reaction medium containing at least 
one hydroxylatable carboxylic acid or at least one 
hydroxylatable carboxylic acid derivative with a mutant 
20 according to the invention, and 

b) isolating the resulting hydroxylated product or product 
mixture from the medium. 



25 The carboxylic acids can be employed in the process according to 
the invention per se or as derivatives thereof, such as, in 
particular, Ci-C4-alkyl esters or carboxamides . 

Hydroxylatable carboxylic acids which are preferably employed in 
30 the process according to the invention are Cg-Cao-monocarboxylic 
acids or derivatives thereof. 



A hydroxylatable carboxylic acid which is especially preferably 
used in the process according to the invention is a 
35 C8-Ci2-monocarboxylic acid or a derivative thereof, and a 

monooxygenase which is especially preferably used in the process 
according to the invention is a mutant in accordance with the 
above group (A) . 

40 In accordance with another process according to the invention, 
the hydroxylatable carboxylic acid used is a 
Ci2-C3o-monocarboxylic acid or a derivative thereof and the 
monooxygenase is a mutant selected from amongst the single 
mutants F87A, F87V, V26T, S72G, A74G and M354T, and the multiple 

45 mutants f) to i) in accordance with the above group (A). 
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The oxidation reaction according to the invention is usually 
carried out in the presence of atmospheric oxygen and in the 
presence of an electron donor (or reduction equivalent) such as, 
in particular^ NADH, NADPH and Zn/Co(III) sepulchrate. The use 
5 of Zn/Co(III) sepulchrate is described, for example, in 
DE-A-199 35 115 which is herewith expressly referred to. 

If the hydroxy lat ion according to the invention is carried out 
with a recombinant microorganism, it is preferred first to 

10 culture the microorganisms in the presence of oxygen and in a 
complex medium such as, for example, TB or LB medium, at a 
cultivation temperature of approximately 20 to 40^C and a pH of 
approximately 6 to 9 until a sufficient cell density is reached. 
Exogenous substrate is added as required. In order better to 

15 control the oxidation reaction, the use of an inducible, in 

particular a temperature-inducible, promoter is preferred. Here, 
the temperature is raised to the induction temperature required, 
for example 42^0 in the case of the PrPi promoter, maintained over 
a sufficient period, for example 1 to 10 or 5 to 6 hours, to 

20 express the monooxygenase activity, and the temperature is then 
dropped to a value of approximately 30 to 40^0 . Culturing is then 
continued for 12 hours to 3 days in the presence of oxygen. 

If, in contrast, the oxidation according to the invention is 
25 carried out with purified or concentrated enzyme, the enzyme 
mutant according to the invention is dissolved in a medium 
comprising exogenous substrate (approximately 0.01 to 10 mM, or 
0.05 to 5 mM), and the reaction is carried out, preferably in the 
presence of oxygen, at a temperature of approximately 10 to SO^C, 
30 such as, for example, 30 to 40°C, and a pH of approximately 6 to 9 
(such as, for example, adjusted with 100 to 200 mM phosphate or 
Tris buffer), and in the presence of a reducing agent, during 
which process the substrate-comprising medium furthermore may 
exhibit an approximately 1 to 100-fold molar excess of reduction 
35 equivalents, based on the substrate to be oxidized. If required, 
the reducing agent may be added portionwise. 

If required, oxygen may be passed into the reaction medium in a 
manner known per se. 



If the processes according to the invention are carried out with 
enriched or purified enzyme, for example the following reaction 
conditions may be set: 



40 



45 Substrate concentration: 
Enzyme concentration : 
Reaction temperature: 



0.1 to 2 0 mg/ml 
0.1 to 10 mg/ml 
20 to 40OC 
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Electron donor: 



pH: 

Buffer : 



6 to 8 

0-05 to 0.2 M potassium 
phosphate , Tr is /HCl 
is preferably added portionwise 
( initial concentration 
approx. 0.1 to 2 mg/ml). 



Before the reaction is started by adding the electron donor, it 
is possible to increase the activity by adding acetone in a 
10 concentration of 1 to 5% (v/v) and briefly preincubating ( 1 to 5 
minutes at approximately 20 to 40^0 ). 

The skilled worker may deviate from these conditions and optimize 
the reaction conditions in question by routine experiments . 

15 

Two different strategies exist in protein engineering for 
generating variants with modified enzymatic properties: firstly, 
the "rational design" method (4), and secondly, the "directed 
evolution" method (5-7). Of central importance for successful 

20 rational design are a highly resolved three-dimensional 
structural model and an in-depth knowledge of the enzyme 
mechanism. While the rational design has been shown to generate 
enzyme mutants which have a high activity for unnatural 
substrates (4), the effect of individual amino acid point 

25 substitutions on stability, activity and specificity of the 
mutants can frequently not be predicted. 

Even through X-ray structures of P450 BM-3 have been deposited in 
the protein database (8)^ the conformation of substrate and P4 50 

30 during the critical hydroxylation stage remains unclear (9-11). 
Of central importance for the directed-evolution method is an 
efficient detection method for the rapid screening of large 
libraries of random mutants. An optical test for the P450 BM-3 
mutant F87A, which employes co-para-nitrophenoxycarboxylic acids 

35 (pNCA), has proved to be useful (12, 13). The single mutant F87A 
shifts the hydroxylation position of fatty acids from position 
co-1, (jL)-2 and a)-3 towards the co-position (13) and additionally 
leads to a complete conversion of 12-pNCA compared with a 33% 
conversion observed for the wild-type enzyme (12). However, since 

40 the monooxygenase domain of P450 BM-3 encompasses over 400 amino 
acids, the screening of a random-mutant library is like looking 
for a needle in a haystack. 

To improve the efficiency of the searching procedure, the 
45 rational-design method was combined in accordance with the 
directed-evolution method so as to generate mutants with a 
modified reactivity, in particular for generating mutants with 
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the specificity for fatty acids with medium chain length. The 
"rational evolution" method according to the invention is based 
on a computer-aided protein modeling to generate a virtual random 
library and to identify residues which are highly likely to 
5 affect the desired properties- A sublibrary is generated by 
randomizing these residues and screened for positive mutants. 
Starting from a structural model, the first step is the 
determination of residues which are potentially important for 
chain-length specificity. To generate mutants with improved 

10 properties, the mutation site proposed by the model is modified 
with the aid of the saturation mutagens. Then, the individual 
mutants with the best properties are combined with each other 
taking into consideration the rational design. The application of 
this combined strategy made it possible in particular to improve 

15 the activity of P450 BM-3 for substrates of medium chain length. 

The application of this strategy and the sequence homology to 
P450 enzymes of different origins allow the skilled worker 
analogously to generate other mutants which are also the subject 
20 of the invention. 

The present invention will now be illustrated by the following 
experiment description and taking into consideration the appended 
figures. The figures show: 

25 

Figure 1: the stepwise optimization of P450 BM-3 for the new 
substrate 8-pNCA. The catalytic efficiency over 8-pNCA is shown 
for each mutant (in the unit s-^M"^); and 

30 Figure 2: a model of the complex of the P450 BM-3-mutant LARVF 
with the substrate 8-pNCA- The side chains which show the five 
best positions for mutations and which were combined to give the 
mutant LARVF are shown {V26T, R47F, A74G, F87V, L188K) . 

35 Method 1; Modeling the substrate/P45Q BM-3 complex 

The modeling of the substrate/enzyme complex was based on the 
structure of P450 BM-3 which had been determined by 
crystallography, complexed with palmitoleic acid (9). This 

40 structure is deposited at the protein database (8). The crystal 
structure described, at a resolution of 2.9 A, contains four 
molecules in an asymmetric unit cell. Chain A was selected as 
reference for the modeled complex. A model of 8-pNCA was 
generated as substrate molecule using the SYBYL "Molecule 

45 Builder" (Tripos, Inc., St. Louis, USA). The F87A mutation was 
generated using the SYBYL Biopolymer Tool. The carbon atoms 1 to 
4 of the substrate were placed into the binding site matching the 
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carbon atoms 6 to 9 of the bound palmitoleic acid. The torsion 
angle of the fatty acid chain was chosen in accordance with the 
transconfiguration step. NMR studies on P450 BM-3/laurate 
complexes and P450 BM-3/12-bromolaurate complexes demonstrated 
5 that the protons of the hydroxy lated carbon atoms (CIO and Cll) 
are at an approximate distance of 3,0 A from the heme iron, which 
is in the II oxidation state (10, 13). As a result of this 
finding, the corresponding atoms C7 and C8 of 8-pNCA were placed 
at a distance of 4 A and 3.6 A, respectively, from the heme iron. 
10 The para-nitrophenoxy group was arranged manually in the binding 
pocket. Also, the energy of the complex was minimized by fixed 
backbone atoms . 

Method 2; Saturation mutagenesis 

15 

The mutants used in accordance with the invention were generated 
with the aid of saturation mutagenesis using the Stratagene 
QuikChange Kit (La Jolla, California, USA) . Nine positions in the 
vicinity of the substrate binding channel, viz. P25, V26, R47, 
20 YSl, S72, A74, F87, L188 and M354, were selected for mutation via 
P4 50 BM-3 modeling. The primers for each of these positions are 
compiled in Table 1 . 



Table 1 : 



25 



35 



40 



Selec- 
ted 
posi- 
tions 


Primers 


Sequence No. 


P25 


5' 


-gttattaaacacagataaannngttcaagctttgatg-3 ' 


SEQ 


ID 


NO: 


9 




5' 


-catcaaagcttgaacnnntttatctgtgtttaataac-3 ' 


SEQ 


ID 


NO: 


10 


V26 


5' 


-gttattaaacacagataaaccgnnncaagctttgatg-3 ' 


SEQ 


ID 


NO: 


11 




5' 


-catcaaagcttgnnncggtttatctgtgtttaataac-3 ' 


SEQ 


ID 


NO: 


12 


R47 


5' 


-cgaggcgcctggtnnngtaacgcgctacttatc-3 ' 


SEQ 


ID 


NO: 


13 




5 ' 


-gataagtagcgcgttacnnnaccaggcgcctcg-3 ' 


SEQ 


ID 


NO: 


14 


YSl 


5 ' 


-cctggtcgtgtaacgcgcnnnttatcaagtcagc-3 ' 


SEQ 


ID 


NO: 


15 




5 ' 


-gctgacttgataannngcgcgttacacgaccagg-3 ' 


SEQ 


ID 


NO: 


16 


372 


5' 


-gctttgataaaaacttannncaagcgcttaaatttgtacg-3 ' 


SEQ 


ID 


NO: 


17 




5 ' 


-cgtacaaatttaagcgcttgnnntaagtttttatcaaagc-3 ' 


SEQ 


ID 


NO: 


18 


A7 4 


5' 


-gctttgataaaaacttaaagtcaannncttaaatttgtacg-3 ' 


SEQ 


ID 


NO: 


7 




5' 


-cgtacaaatttaagnnnttgacttaagtttttatcaaagc-3 ' 


SEQ 


ID 


NO: 


8 
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L188 


5' 


-gaagcaat:gaacaagnnncagcgagcaaatccag-3 ' 


SEQ 


ID 


NO: 


5 






5 ' 


-ctggatttgctcgctgnnncttgttcattgcttc-3 ' 


SEQ 


ID 


NO: 


6 




M354 


5' 


-ggcgacgaactannngttctgattcctcag-3 ' 


SEQ 


ID 


NO: 


19 


5 




5' 


-ctgaggaatcagaacnnntagttcgtcgcc-3 ' 


SEQ 


ID 


NO: 


20 




F87 


5' 


-gcaggagacggggttgnnnacaagctggacg-3 ' 


SEQ 


ID 


NO: 


3 






5' 


-cgtccagcttgtnnncaaccccgtctcctgc-3 ' 


SEQ 


ID 


NO: 


4 



10 

The reaction conditions were identical for all mutagenic PGR 
processes, with the exception of the annealing temperature, which 
was varied as follows: 50°C for positions 25, 26, 188 and 354; 
520C for positions 47 and 51; and 46°C for positions 72, 74 and 

15 87. The reactions were carried out in reaction volumes of 50 pil, 
each batch containing 17.5 pmol of each primer, 20 pmol of the 
template plasmid DNA, 3 U of the Pfu polymerase and 3.2 5 nmol of 
each dNTP. The reaction was started at 95^0, 4 minutes, and the 
batch then undergoes the following thermocycle 20 times: 95^0, 

20 1 minute; 46-52°C, 2.5 minutes; 72°C, 17 minutes; after these 

20 cycles, the reaction was continued for 15 minutes at 72^0 . To 
carry out a site-specific mutagenesis by PGR, an individual codon 
was altered for exchanging one aimino acid. To randomize a 
specific amino acid, primers were used in which ''nnn" encodes the 

25 specific amino acid. All PGR product solutions were treated for 
3 hours at 37^C with 2 0 U Dpnl to digest the original unmutated 
template DNA- Transformation into E. coli DH5a was then carried 
out . 

30 Method 3: Expression and purification of wild-type enzyme and of 
the mutants 

The P4 50 BM-3 wild-type gene and its mutants wee expressed under 
the control of the strong temperature-inducible PrPl promoter 

35 pGYTEXPl into E. coli strain DH5a (supE44, lacU169 [SOlacZ M15] 
hsdR17 recAl endAl gyrA96 thi-1 relAl). The single point mutation 
F87A was introduced in accordance with the prior art (12). The 
transformed cells were plated onto LB agar plates which contained 
100 pig/ml ampicillin. After the colonies had grown for 12 to 

40 24 hours, they were picked up with sterile toothpicks and placed 
into 96-well microtiter plates, each well containing 200 [il of TB 
medium (12 g tryptophan, 24 g yeast extract, 4 ml glycerol, 
(distilled) H2O to 1 liter) together with 100 jxg/ml ampicillin. 
The plates were incubated overnight at 37°C. 40 \il were 

45 subsequently removed from each well and transferred into culture 
tubes which contained 2 ml TB medium with 100 \xg/ml ampicillin, 
and the batches were subsequently incubated for 2 hours at 37oG 
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and then for 6 hours at 42^0- The cells Were removed by 
centrifugation for 5 minutes at 4000 rpm, treated with chicken 
albumen lysozyme (1 U/ml), and then twice frozen and defrosted. 
The crude cell extracts were obtained by centrifugation for 
5 10 minutes at 14,000 rpm. The activity was measured in the 

resulting supernatant. To produce large quantities of enzyme, a 
2-1 shake flask with 300 ml of TB medium with 100 \xg/ml 
cimpicillin was used, incubated at 37°C, shaken for 2 hours at 
200 rpm (ODsysnm = 0-8 to 1.0) and subsequently incubated for 

10 6 hours at 42°C. The cells were collected by centrifugation for 
10 minutes at 4000 rpm and suspended in 15 ml of 0-1 M potassium 
phosphate buffer, pH 7.4. The ice-cooled suspension was disrupted 
with the aid of a Branson sonifier W25 ( Dietzenbach, Germany) 
(80 W, 2 minutes, 3 cycles). The suspension was centrifuged for 

15 20 minutes at 32570 x g. The crude extracts were used for 
determining the enzyme activity or for enzyme purification - 

Enzyme purification was carried out as described in (14), but 
using a BioPilot chromatography (Pharmacia, Sweden). Enzyme 

20 purity was determined by determining the total protein and the 
amount of enzyme. The concentration of purified enzyme was 
determined from the difference between the absorption spectrum of 
the carbonyl complex of the iron(II) form in comparison with the 
iron(II) form using a molecular absorptivity of 91 mM-i cm-^ for 

25 the wavelength pair 450 nm and 490 nm (1). 

Method 4: Isolation of mutants with higher activity for 
substrates of shorter chain length 

30 Instead of the wild type, the P450 BM-3 mutant F87A was used as 
template DNA. The mutations for the position in question were 
generated as described above. In each case approximately 100 
colonies were picked for screening the sequence zone at each 
position and grown in culture tubes, and cells were isolated 

35 therefrom and lysed. The crude cell extracts from each selected 
colony were used for the activity test. All mutants which showed 
a higher activity than F87A for at least one substrate with a 
shorter chain length than 15-pNCA was sequenced for identifying 
the mutations - 

40 

The mutant with the highest activity for 12-pNCA, 10~pNCA or 
8-pNCA, respectively, amongst all mutants for the same position 
was selected for a later combination with other mutations. The 
combined mutation was carried out stepwise by site-specific 
45 mutagenesis. The combinatory procedure is shown in Figure 1. Six 
colonies were isolated from each combination step to determine 
the substrate specificity. A colony with a representative 
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substrate specificity was selected^ and the substrate specificity 
was determined for the pure enzyme. The plasmid of the selected 
colony was used for the next step of the site-specific 
mutagenesis. The mutations in the final mutant were identified by 
5 DNA sequencing (ABI PRISM^ BigDye™ Terminator Cycle Sequencing 
Ready Reaction Kit and ABI Prism™ 377 dna Sequencer) . 

Method 5; Enzyme activity assay 

10 For the pNCA activity assay use was made of 8 |j,1 of a 10 mM pNCA 
solution in DMSO in a disposable cell in an end volume of 1 ml. 
After 850 ^1 of Tris/HCl buffer (0.1 M, pH 8.2) and 0.1 to 
0.4 nmol P450 had been added to the pNCA/DMSO solution in 
question, the samples were preincubated for 5 minutes before the 

15 reaction was started by adding 50 \il of an aqueous solution of 
1 mN NADPH. To determine Kcat and Km, a concentration series was 
established for the various pNCA substrates (regarding details of 
the detection method, see (12)). 

20 A 96-well plate (Greiner, Frickenhausen, Germany) was used for 
carrying out the pNCA test in a microtiter plate. In a total 
reaction volume of 2 50 \il in Tris/HCl buffer (0.1 M, pH 8.2), the 
reaction batch contained either 60 nmol 8-pNCA, 15 nmol 10-pNCA, 
15 nmol 12-pNCA or 15 nmol 15-pNCA, in each case dissolved in 

25 2.5 fxl of DMSO. After the samples had been preincubated for 
5 minutes with 4 0 jxl of P450 BM-3 samples, the reaction was 
started by injecting, into each well, 30 ^1 of a 1 mM NADPH 
solution. Immediately after the NADPH solution had been added, 
the plates were measured in a plate reader at 405 nm. 



Method 6; Conversion of carboxylic acids using mutants according 
to the invention, and identification of the products 

a) Chemical reaction 



The following batch was chosen to hydroxylate the fatty acids in 
the presence of P450 BM-3 or its mutants: 



30 



35 



P450 BM-3 mutant 



2 0 mg 

20 ml (Tris/HCl 50mM, KCl 250mM, pH 7.8) 

10 mg 

400 txl (2 % v/v) (accelerates the 



40 Reaction buffer 
Fatty acid 
Acetone 



reaction by a factor of 2 ) 



45 
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Prior to the reaction, the enzyme lyophilisate was dissolved in 
1 ml of reaction buffer and first incubated for 3 0 minutes at 
36^0 . Like the addition of 2% v/v acetone, the incubation leads to 
an increase in activity of 60 and 75%, respectively. 

5 

After 5 minutes' incubation, 500 \xl of the previously prepared 
NADPH solution (12.5 mg/ml) were added. The course of the 
reaction is monitored by absorption measurements at 340 nm, and 
the consumption of NADPH can be observed. Theoretically 4 ml of 
10 NADPH solution would be required for a stoichiometric conversion, 
however, an unduly high NADPH concentration in the reaction 
solution results in inactivation of the enzyme, which is why the 
cof actor is added in 500 \il steps. 

15 After the reaction had ended, the solution was acidified with 5M 
hydrochloric acid to a pH of 2 . The fatty acids or hydroxylated 
fatty acids precipitate during this and result in a noticeable 
cloudiness of the solution. Thereafter, the mixture was extracted 
twice with in each case 10 ml of dichloromethane and dried over 

20 sodium sulfate. After filtration through a folded filter, the 
dichloromethane was removed on a rotary evaporator or by 
evaporation with nitrogen. The white solid which remained was 
taken up in 2 ml of dichloromethane and used in the GC analysis. 

25 b) Analysis by gas chromatography 

A Fisons gas chromatograph (Fisons Instruments Mega Series, 
Mainz, Germany) equipped with FID was used for the analysis 
(Optima 5-column, internal diameter 25 m x 0.25 mm, Macherey & 
30 Nagel, Diiren, Germany) . 

For analysis, starting materials and products were silylated with 
MSHFBA- During this, all hydroxyl and acid groups are converted 
into the corresponding trimethylsilyl ethers and esters, 
35 respectively. Care must be taken that the sample is anhydrous 
since otherwise side reactions with the silylating agent result. 
15 ^il of MSHFBA were pipetted into 10 jxl of the 

hydroxy-f atty-acid-containing dichloromethane solution and the 
mixture was incubated for 15 minutes at room temperature. After 
40 25 ^1 of dichloromethane had been added, the GC analysis was 
carried out. 

To this end, 1 \xl of the silylated samples were injected into the 
gas chromatograph. Injector and detector temperature were 350^0. 
45 The following temperature programmes were used: 
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Caprylic acid 

100 (2 min) 5Q/min ^ 200 IQQ/min ^ ^^^^^ 

g Capric acid 

100 oc (2 min) 5°/min ^ (5 min) lQ°/"^J-n^ ^^^^^ 
Laurie acid 

100 (2 min) 5o/min ^ ^^.^^ lOQ/mm ^ 3^^ ^^^^^ 



Example 1 : Selection of -the starting mutant 



]L5 The P450 BM-3 single mutant F87A was used as starting template - 
As already mentioned, this mutation shifts the hydroxylation 
position in saturated C12- and C14-fatty acids of co-1, co-2 and 
(0-3 towards the co-position (13). For the conversion of 12-pNCA 
and 15-pNCA, it was also shown that the co-hydroxylation is 

20 enhanced significantly over the wild type (12). As regards the 
co-hydroxylation of 12-pNCA, a complete conversion was obtained 
compared with a 33% conversion in the case of the wild type. In 
addition to the increased regioselectivity, increased activities 
for 12-pNCA and 15-pNCA were also observed (compare Table 2). 

25 

Table 2: Specific activity of selected P450 BM-3 mutants for 
various pNCA derivatives with different chain lengths ^ 



30 



Sub- 
strate 


WT 


F87A 


L188K 


V2 6T 


R47F 


S72G 


A74G 


M354T 


15-pWCA 


405 


410 


288 


519 


258 


439 


474 


560 


12-pNCA 


141 


284 


316 


555 


233 


596 


517 


480 


10-pNCA 


339 


92 


207 


106 


52 


150 


103 


171 


8-pNCA 


15 


2 


69 


16 


13 


3 


6 


4 



1 Units of specific activity: nmol/min/nmol P450 



The increased regioselectivity of F87A can be understood with 
reference to the structural model. The substitution of the bulky 
residue F87 by alanine enlarges the binding pocket for the 
p-nitrophenoxy group formed by F87, and also by V78, A82, T260, 
1263 and A264. In addition, the access of the large 

p-nitrophenoxy group to the binding pocket is facilitated and the 
steric interactions between F87 and the carbon atoms (o-2 and co-l 
of the pNCA are eliminated. This allows a more advantageous 
orientation of the co-position relative to the heme iron atom. 
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Example 2; Selection of individual mutation positions by 
modeling f site-specific randomization mutagenesis of selected 
positions, and screening 

5 1. Selection of the mutation positions 

The model of the bound C8-pNCA substrate shows that the 
carboxylate is at a distance of 9 and 11 A from the 
carboxylate-binding residues R47 and Y51, respectively. To induce 

10 activity for C8 substrates, it is necessary to generate a new 

binding site which is at a suitable distance from the carboxylate 
group of the substrate. Additional residues should be positioned 
within the binding site to facilitate the binding of 8-pNCA, The 
following residues were selected: the residues R47 and Y51, which 

15 form the original carboxylate binding site. For R47, it is 

proposed to form an ion pair between its guanidinium group and 
the carboxylate residue of the substrate; Y51 forms a hydrogen 
bridge with the carboxylate group of the substrate ( 9 ) . In the 
palmitoleic acid/P450 BM-3 complex, the distance between the Ca 

20 atom of R4 7 and the C atom of the fatty acid carboxylate group is 
12 A. All residues in the hemisphere with a 12 A radius around 
the carboxylate group of the 8-pNCA model were identified, and 
only those were selected within the binding pocket which have 
side chains pointing towards the carboxylate group of the fatty 

25 acid. In this manner, P25, V26, S72, A74, L188 and M354 were 
selected, and these are probably capable of forming a new 
carboxylate binding site for short-chain pNCA compounds. These 
six selected residues are located on secondary structural 
elements with a flexible conformation (11). 

30 

2 . Mutation and screening 

Mutants of each of the eight selected residues were generated by 
site-specific randomization of the wild-type codon at the 

35 positions in question. To ensure that most of the 19 possible 

amino acid species are tested, 100 colonies of each position were 
isolated, cultured and tested for activity (the probability that 
each amino acid is tested is thus greater than 95%, with the 
exception of a probability of 79% for tryptophan and methionin) . 

40 The mutants with a higher activity value for at least one 

substrate with a shorter chain length than 15-pNCA were selected 
for sequencing for identifying the mutation(s). 

It emerged that position 188 is relatively variable. Most of the 
45 100 colonies showed activity for pNCA. From these, 37 colonies 
were selected in accordance with their activity for 8-pNCA. 16 
different amino acid types inclusive of the wild type were 
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detected. This result furthermore confirmed that the selection of 
100 colonies was sufficient to ensure that most of the 20 amino 
acids which are possible can be tested. Amongst the 15 
substitutions, substitutions K, Q, N, G, A and S 

5 significantly increased the catalytic activity for substrates of 
shorter chain lengths . The substitution of L by negatively 
charged amino acids led to an activity for 10-pNCA, 12-pNCA and 
15-pNCA which was reduced by a factor of three to seven. 

10 The remaining seven positions were randomized in a manner similar 
to position 188. 7 to 19 colonies were selected from each 
position for DNA sequencing for detecting the mutation(s). Most 
of these genes were either unmutated, or else the gene product 
showed reduced activity for substrates of shorter chain lengths 

15 (the tests were carried out with pure enzyme) . In each case only 
one mutant of positions 26, 47, 74 and 354, two mutants of 
position 72 and no mutants of positions P2 5 and Y51 showed a 
higher activity for substrates of shorter chain length in 
comparison with 15-pNCA- The mutant with the highest activity for 

20 8-pNCA in position 188 and position 72, and also single mutants 
of positions 26, 47, 74 and 354 were selected for a combination. 
These mutations, which are listed in the above Table 2, contain 
the following substitutions: V26T, R47F, S72G, A74G, L188K and 
M354T. In comparison with F87A, the specific activity for 8-pNCA 

25 was increased by the factor 0.5 to 33.5- Each of these mutations 
was combined stepwise with the original mutation F87A- 

Example 3; Multiple mutants generated by stepwise combination of 
mutants , and their screening 

30 

1. Mutant L (F87A, L188K) 

The selection of the first mutant is critical since the 
combination of single mutants does not necessarily result in a 

35 cumulation of the individual effects. The wrong selection of the 
first mutation might lead towards an evolutionary pathway which 
does not result in an optimal multiple mutation. In the case of a 
large starting pool of single mutants, an enormous combinatory 
diversity is likely and, as a consequence, finding the desired 

40 multiple mutant is unlikely. 

Two criteria favor the selection of mutant L as starting 
material: its increased activity for 8-pNCA compared with all 
other mutants (Table 2), and the changes in the enzyme/ substrate 
45 interaction and the properties of the binding site. To verify 
these changes, structural models of the selected mutants were 
generated by replacing the side chains of the mutation positions 
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with the aid of the SYBYL method. L188 is located on the 
C terminus of the a-helix F. This helix and the vicinal G helices 
experience the greatest shift by substrate binding (9). The model 
illustrates that the exchange of leucine for lysine leads to the 
5 formation of a novel putative carboxylate binding site. The 
distance between the amino group of K188 with the carboxylate 
residue of 8-pNCA is 6 A and thus comparable with the distance of 
the carboxylate group of palmitoleic acid and of the guanidinium 
group from R4 7 (the original binding residue) , which value has 

10 been determined experimentally. The distance between the 

guanidinium group of R47 and the carboxylate group of 8-pNCA is 
11.4 A. It can therefore be proposed that the ion-pair reactions 
between the carboxylate group of the substrate and K188 together 
with the hydroxylated C8 atom take place in a reactive distance 

15 to the heme iron atom. 

The other five mutants were combined stepwise with mutant L by 
site-specific mutagenesis (compare Figure 1). A structural model 
of the mutants selected was generated for each mutagenesis step. 
20 The effects, of the mutants generated, on substrate binding was 
studied with reference to this model, and, on this basis, a 
rational explanation for the optimized properties of the mutants 
and their combinations was proposed. 

25 2. Mutant LA (F87A, L188K, A74G) 

Catalytic efficiency over 8-pNCA was increased by introducing 
mutation A74G into mutant L. A74 is located on the N-terminus of 
the a-helix B'. Since the side chain of A74 sterically interacts 
30 with the amino group of K188, this interaction is eliminated by 
exchanging A74 for glycine, thus allowing a more advantageous 
orientation of the side chain relative to the carboxylate group 
of the substrate. 

35 3. Mutant LAR (F87A, L188K, A74G, R47F) 

The catalytic efficiency for 8-pNCA was successfully improved by 
introducing the additional mutation R47F. The mutation R4 7F had 
two possible effects. Phenylalanine hinders the original 
40 carboxylate binding and enlarges the hydrophobic section, which 
is exposed to the solvent, at the entrance of the binding channel 
formed by the residues Fll, L14, L17, P18, P45 and A191 (compare 
Figure 2). This hydrophobic section is particularly important for 
attaching the substrate (11). 



0050/51241 



23 

4. Mutant LARV (F87A, L188K, A74G, R47F, V26T) 

If the mutation M354T is added to mutant LAR, the Kcat value for 
8-pCNA is reduced. M354T was therefore not involved in further 
5 combination experiments. However, if the mutation V26T is added 
to mutant LAR, the resulting mutant LARV shows a slightly higher 
Kcat value and a slightly lower Km value than LAR, The catalytic 
efficiency for 8-pNCA was increased. If ^ instead, mutation S72G 
is carried out in mutant LAR, the Kcat value of 8-pNCA was 
10 reduced. This is why mutant LARV was selected for further 
mutations . 

V26 is located at the N-terminus of the a-helix A. The model 
suggests that T2 6 might assume the role of the original residue 

15 Y51 of the binding site and might stabilize the carboxylate group 
of the substrate by forming a hydrogen bridge - The distance 
between the hydroxyl group of T26 and the amino group of K188 is 
6-9 A. Compared with the original binding site, this distance is 
approximately 2 A greater (4.8 A between the hydroxyl group of 

20 Y51 and the guanidinium group of R47). It is therefore possible 
that the Kl88-containing F helix is subjected to further 
conformational alterations, owing to which 8-pNCA is retained 
deeper down in the binding region, and the distance between T26 
and K188 is reduced. 

25 

5. Mutant LARVF (F87V, L188K, A74G, R47F, V26T) 

It was possible to demonstrate that position 87 is of particular 
importance for the catalytic activity and the substrate 

30 specificity of the enzyme (3, 13). This is why mutant LARV was 
optimized in position 87 by site-specific randomized mutagenesis - 
Amongst 100 colonies, one mutant was obtained which exhibited a 
catalytic efficiency of 3.5*10^ s-^ M-^ for 8-pNCA. The DNA 
sequence data revealed that alanine in position 87 was replaced 

35 by valine. 

It is known from the crystal structure (11) and earlier 
experiments (3, 13) that R87 is important for the access of the 
substrate to the heme iron. The bulky para-nitrophenyl group of 
40 this substance made it necessary to increase the size of the 
binding site by exchanging phenylalanine for alanine. By 
exchanging A87 for valine, the contact of C^^ with the heme iron 
atom is improved, owing to steric interactions between the ether 
oxygen and the V87 side chain. 

45 
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The data reveal that two key steps, F87A to L and LARV to LARVF, 
increased the catalytic efficiency for 8-pNCA. This raised the 
question as to whether the other mutations in mutant LARVF can be 
reversed without losing the activity for 8-pNCA. Mutants were 
5 therefore generated which contain mutation F87V instead of 
mutation F87A. These mutants have the following names: 

1. ) L7V with the mutations F87V L188K 

2. ) AL7V with the mutations F87V L188K A74G 

10 3.) ARL7V with the mutations F87V L188K A74G R47F 

The following results were obtained in the activity assays. 

Table 3: Kinetic parameters of P450 BM-3 mutants for pNCA 
15 derivatives of different chain lengths, determined at pH 8.0 and 
25C 



25 





Kcat (s-i) 


Km {\m) 


Kcat/Km (s-iM-i) 




12 


10 


8 


12 


10 


8 


12 


10 


8 


F87V 


0.7 


1.8 




3.8 


8.5 




1.7-105 


2-1-105 




L7V 


2.8 


4.7 




6.4 


14.2 




4.3-105 


3.3-105 




AL7V 


2-2 


5.9 


4.3 


15. 1 


22.7 


197.6 


1-4-105 


2-6-105 


2.0-104 


ARL7V 


1.4 


5.5 


3.9 


8.9 


17.5 


41.4 


1.7-105 


3.1-105 


9.3-104 


ARVLF 


1.4 


7.2 


0.2 


12.3 


44.8 


6.5 


9-2-104 


1-6-105 


3.5-104 



The results show that mutant F87V and L7V show no measurable 
conversion for the substrate 8-pNCA. The results furthermore show 
that the quadruple mutant ARL7V shows an even better catalytic 
efficiency than the quintuple mutant ARVLF. 



30 

Moreover, mutants ARL7V and ARVLF show high activities for capric 
acid (CIO), which is shorter by two carbon atoms than the 
shortest native fatty acid substrate of the wild-type enzyme. 
Moreover, co-4~monohydroxylated products were observed when lauric 
35 acid (C12) is used as substrate, while the wild-type enzyme of 
P450 BM-3 is only active in the positions co-1, co-2 and co-3. 

Example 4; Determination of the preferred hydroxylation position 
for carboxylic acids of different chain lengths and various 
40 mutants, and comparison with the wild-type enzyme 

The reaction batches are worked up and analyzed in accordance 
with the above -de scribed method (6). 



45 
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The results are compiled in Table 4 below. 



Table 4 : 



5 


Hydroxylation 


Capric acid (CIO) 


Laurie acid (C12) 




positions 


[%] 


r Q, 1 




Wild type 








co-3 






10 


co-2 








co-l 




38 




Mutant L7V 








co-3 




53 


15 


(jd-2 




30 




co-l 




17 




Mutant F87V 








co-3 




51 




co-2 




25 


20 








co-l 




24 




Mutant AIiTV 








co-3 




28 




CO — z 






25 


co-l 




18 




Mutant ARL7V 








CO-3 


14 


35 




co-2 


33 


50 


30 


co-l 


53 


15 




Mutant ARVLF 








CO-3 


15 


34 




co-2 


30 


53 


35 


co-l 


55 


13 



What is noticeable in the hydroxylation of lauric acid is that a 
change in regioselectivity first takes place during the 
transition from the wild type to P450 BM-3 F87V, which 
40 preferentially catalyzes an co-3 hydroxylation, as is the case with 
P450 BM-3 L7V. During the transition to the triple mutant, the 
regioselectivity changes again; P450 BM-3 ALTV, P450 BM-3 ARL7V 
and P450 BM-3 ARVLF preferentially direct the hydroxylation to 
the co-2 position. 
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All GC analyses demonstrate that capric acid is only converted by 
P450 BM-3 ARVLF and P450 BM-3 ARL7V, while a conversion of less 
than 1% is found in the other mutants. Both enzymes exhibit a 
virtually identical regioselectivity , with the co-l position being 
5 preferred. 

To determine the reaction yields, gas chromatograms of standards 
were recorded. The starting material standard used was capric or 
lauric acid solution with dichloromethane as solvent 

10 (concentration 0.5 mg/ml) . Owing to the FID detection method, it 
was not possible to simply compare the peak areas of starting 
materials and products with one another since molecules which 
have different structural and empirical formulae generate 
different ion fluxes upon combustion. These ion fluxes are not 

15 proportional to the quantitative ratio of starting material and 
product. The product standard used was therefore commercially 
available 10-hydroxycapric acid or 12-lauric acid. The empirical 
formulae of the standards and of the products are identical, and 
the structure only differs with regard to the position of the 

20 hydroxyl group, which is why approximately identical detectable 
ion fluxes can be assumed for identical quantities - 

In the hydroxylations of capric acid with P450 BM-3 ARVLF and 
P450 BM-3 ARL7V catalysis, the cumulative product yields were 57% 
25 and 38%, respectively. In the hydroxylations of lauric acid, the 
yield was 51% for P450 BM-3 ARVLF catalysis and between 38% and 
40% for all other mutants and for the wild type- 

30 



35 



40 



45 



0050/51241 

27 

References 

1. Boddupalli, S. S., et al., (1990) J. Biol. Cham. 265, 
4233-4239 

5 

2. Capdevila, J. H., et al., (1996) J. Biol- Chem. 271, 
22663-22671 

3. Graham-Lorence, et al., (1997) J. Biol. Chem. 272, 
10 1127-1135 

4. Cleland, J. L. and Craik, C. S., Eds. (1996) Protein 
Engineering: Principles and Practice, Wiley-Liss, New York 

15 5. Kuchner, O. and Arnold, F. H-, (1997) Trends Biotechnol. 15, 
523-530 

6. Stemmer, W. P.C. (1994) Nature (London) 370, 389-391 
20 7. Bornscheuer, U. T., (1998) Angew. Chem- 110, 3285-3288 

8. Bernstein, F. C, et al.,( 1977) J. Mol. Biol. 112, 525-542 

9. lii, H. and Poulos, T. L. (1997) Nat. Structural Biol. 4, 
25 140-146 

10. Modi, S., et al.,(1996) Nat. Structural Biol. 3, 414-417 

11. Ravichandran, K. G., et al-, (1993) Science 261, 731-736 

30 

12. Schwaneberg, U., et al., (1999) Anal. Biochem. 269, 359-366 

13. Oliver, C.F., et al-,(1997) Biochemistry, 36, 1567-1572 

35 14. Schwaneberg, U., et al., J. Chromatography A, in press 

15. Cherry, J. R. , et al . , (1999) Nature Biotechnology, 17, 
379-384 

40 



45 



0050/51241 

28 



SEQUENCE LISTING 

<110> BASF Aktiengesellschaf t 

<12 0> Modified cytochrome P4 50 monooxygenases 

<130> M/40434 

<140> 
<141> 

<160> 20 

<170> Pa-tentin Ver. 2,1 

<210> 1 
<211> 3150 
<212> DNA 

<213> Bacillus megaterium 

<220> 

<221> CDS 

<222> (4). .(3150) 

<400> 1 

atg aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa 4 8 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
15 10 15 

aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 96 
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
20 25 30 

att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 14 4 
lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 

gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tge gat 192 
Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 

gaa tea cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 24 0 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 

gat ttt gca gga gae ggg tta ttt aca age tgg acg eat gaa aaa aat 288 
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
80 85 90 95 

tgg aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca 336 
Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 
100 105 110 

atg aaa ggc tat eat gcg atg atg gte gat ate gee gtg cag ctt gtt 384 
Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 
115 120 125 

caa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 432 
Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu 
130 135 140 

gac atg aca cgt tta acg ctt gat aca att ggt ctt tge ggc ttt aac 480 
Asp Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn 
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145 150 155 

tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca 52 8 
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 
160 165 170 175 

agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca 57 6 
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 190 

aat cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa 624 
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 67 2 
Asp lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg 
210 215 220 

aaa gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac 720 
Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 

gga aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc 768 
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn lie Arg 
240 245 250 255 

tat caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt 816 
Tyr Gin lie lie Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 

ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 864 
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 

caa aaa gca gca gaa gaa gca gca cga gtt eta gta gat cct gtt cca 912 
Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 

age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac 960 
Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca 1008 
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 
320 325 330 335 

aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 1056 
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 
340 345 350 

gaa eta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg 1104 
Glu Leu Met Val Leu He Pro Gin Leu His Arg Asp Lys Thr He Trp 
355 360 365 

gga gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt 1152 
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

gcg att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg 1200 
Ala He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt 1248 
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Cys 
400 


lie 


Gly 


Gin 


Gin 
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TV T -» 
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Glu 
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Met 
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ttt 
pne 
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VjXU 


gat 
Asp 
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cat 
His 


aca 
Thr 


aac 
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tac 
Tyr 


gag 
Glu 
430 


ctg 
Leu 


1296 


gat 
Asp 


att 
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Lys 


gaa 
Glu 
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Thr 
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Lys 
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gaa 
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Val 
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Lys 


1344 
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Ala 
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Lys 
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Ser 
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aaa 
Lys 


aaa 
Lys 
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1440 


acg 
Thr 
480 
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Pro 


ctg 
Leu 


ctt 
Leu 
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Val 
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Leu 
485 
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Tyr 


ggt 
Gly 
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aat 
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Leu 
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age 
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Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 

ctt caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa 2064 
Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 
675 680 685 

ctt cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 2112 
Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie 
690 695 700 

cct cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc 2160 
Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 

eta gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta 2208 
Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 
720 725 730 735 

get eat ttg cca etc get aaa aca gta tee gta gaa gag ctt ctg caa 2256 
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 

tac gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg 2304 
Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 

get get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gee ttg 2352 
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

ctt gaa aag caa gee tac aaa gaa caa gtg ctg gca aaa cgt tta aca 24 00 
Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 

atg ctt gaa ctg ctt gaa aaa tac ccg geg tgt gaa atg aaa ttc age 2448 
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
800 805 810 815 

gaa ttt ate gee ctt ctg cca age ata cgc ccg cgc tat tac teg att 2496 
Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 
820 825 830 

tct tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age 2544 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 
835 840 845 

gtt gtc tea gga gaa geg tgg age gga tat gga gaa tat aaa gga att 2592 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

geg teg aac tat ctt gee gag ctg caa gaa gga gat acg att acg tgc 2640 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
865 870 875 

ttt att tec aca ccg cag tea gaa ttt acg ctg cca aaa gae cct gaa 2688 
Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
880 885 890 895 

acg ccg ctt ate atg gtc gga ccg gga aca ggc gtc geg ccg ttt aga 2736 
Thr Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 

ggc ttt gtg cag geg cgc aaa cag eta aaa gaa caa gga cag tea ctt 2 7 84 
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Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

gga gaa gca cat tta tac ttc ggc tgc cgt tea cct cat gaa gac tat 2832 
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 

ctg tat caa gaa gag ctt gaa aac gcc caa age gaa ggc ate att acg 2 880 
Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr 

945 950 955 

ctt cat acc get ttt tct cgc atg cea aat cag ccg aaa aca tac gtt 2928 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
960 965 970 975 

cag cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat 2976 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 
980 985 990 

caa gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct 3024 
Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

gee gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 3072 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc 3120 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 

ega tac gca aaa gac gtg tgg get ggg taa 3150 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 



<210> 2 
<211> 1048 
<212> PRT 

<213> Bacillus megaterium 
<400> 2 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
15 10 15 

Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys lie 
20 25 30 

Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg Val 
35 40 45 

Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp Glu 
50 55 60 

Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg Asp 
65 70 75 80 

Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp 
85 90 95 

Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 
100 105 110 



Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val Gin 
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115 120 125 

Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu Asp 
130 135 140 

Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn Tyr 
145 150 155 160 

Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr Ser 

165 170 175 

Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala Asn 
180 185 190 

Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Asp 
195 200 205 

lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg Lys 
210 215 220 

Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 
225 230 235 240 

Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg Tyr 
245 250 255 

Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly Leu 
260 265 270 

Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin 
275 280 285 

Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
290 295 300 

Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305 310 315 320 

Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 
325 330 335 

Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 
340 345 350 

Leu Met Val Leu He Pro Gin Leu His Arg Asp Lys Thr He Trp Gly 

355 360 365 

Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
370 375 380 

He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala Cys 
385 390 395 400 

He Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 
405 410 415 

Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 
420 425 430 

He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala 
435 440 445 

Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu 
450 455 460 
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Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 
465 470 475 480 

Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 
485 490 495 

Ala Arg Asp Leu Ala Asp lie Ala Met Ser Lys Gly Phe Ala Pro Gin 
500 505 510 

Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
515 520 525 

Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
530 535 540 

Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val Lys 

545 550 555 560 

Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 
565 570 575 

Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala Lys 
580 585 590 

Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
595 600 605 

Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 615 620 

Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys Ser 
625 630 635 640 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 
645 650 655 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 
660 665 670 

Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu Leu 
675 680 685 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He Pro 
690 695 700 

Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly Leu 
705 710 715 720 

Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 
725 730 735 

His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 
740 745 750 

Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 
755 760 765 

Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
770 775 780 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 800 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 
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805 810 815 

Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie Ser 
820 825 830 

Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser Val 
835 840 845 

Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie Ala 
850 855 860 

Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys Phe 
865 870 875 880 

lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 
885 890 895 

Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 
900 905 910 

Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 
915 920 925 

Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr Leu 
945 950 955 960 

His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 
965 970 975 

His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp Gin 
980 985 990 

Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 1000 1005 

Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val Ser 
1010 1015 1020 

Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly Arg 
1025 1030 1035 1040 

Tyr Ala Lys Asp Val Trp Ala Gly 
1045 



<210> 3 
<2H> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 3 

gcaggagacg ggttgnnnac aagctggacg 



<210> 4 
<211> 30 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 4 

cgtccagctt gtnnncaacc cgtctcctgc 

<210> 5 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 

<400> 5 

gaagcaatga acaagnnnca gcgagcaaat ccag 

<210> 6 
<211> 34 
<212> DNA 

<213> Artificial sequence 

<220> 

<2 23> Description of the artificial sequence: PGR primer 
<400> 6 

ctggatttgc tcgctgnnnc ttgttcattg cttc 

<210> 7 
<211> 41 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 7 

gctttgataa aaacttaaag tcaannnctt aaatttgtac g 

<210> 8 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 8 

cgtacaaatt taagnnnttg acttaagttt ttatcaaagc 

<210> 9 
<211> 37 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: PGR primer 
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<400> 9 

gttattaaac acagataaan nngttcaagc tttgatg 

<210> 10 
<211> 37 
<212> DNA 

<213> Artificial sequence 
<220> 

<2 23> Description of the artificial sequence: PGR primer 
<400> 10 

catcaaagct tgaacnnntt tatctgtgtt taataac 

<210> 11 
<211> 37 
<212> DNA 

<213> Artificial sequence 
<220> 

<2 23> Description of the artificial sequence: PGR primer 
<400> 11 

gttattaaac acagataaac cgnnncaagc tttgatg 

<210> 12 
<211> 37 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 12 

catcaaagct tgnnncggtt tatctgtgtt taataac 

<210> 13 
<211> 33 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 13 

cgaggcgcct ggtnnngtaa cgcgctactt ate 

<210> 14 
<211> 33 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR-primer 
<400> 14 

gataagtagc gcgttacnnn accaggcgcc teg 



<210> 15 
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<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Description of the artificial sequence: PGR primer 
<400> 15 

cctggtcgtg taacgcgcnn nttatcaagt cage 34 

<210> 16 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 

<400> 16 

gctgacttga taannngcgc gttacacgac cagg 34 

<210> 17 
<211> 40 
<212> DNA 

<213> Artificial sequence 

<220> 

<2 23> Description of the artificial sequence: PGR primer 
<400> 17 

gctttgataa aaacttannn caagcgctta aatttgtacg 4 0 

<210> 18 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 18 

cgtacaaatt taagcgcttg nnntaagttt ttatcaaagc 4 0 

<210> 19 
<211> 30 
<212> DNA 

<213> Artificial sequence 

<220> 

<2 23> Description of the artificial sequence: PGR primer 



<400> 19 

ggcgacgaac tannngttct gattcctcag 

<210> 20 
<211> 30 
<212> DNA 

<213> Artificial sequence 



30 



<220> 
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<223> Description of the artificial sequence 
<400> 20 

ctgaggaatc agaacnnnta gttcgtcgcc 
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We claim: 

1. A modified cytochrome P450 monooxygenase which, in comparison 
5 with the wild-type enzyme, shows an altered substrate profile 

in the terminal and/or subterminal enzymatic hydroxylation of 
aliphatic carboxylic acids, owing to site-specific 
mutagenesis of its substrate binding region. 

10 2. A monooxygenase as claimed in claim 1, which is derived from 
cytochrome P450 monooxygenases of bacterial origin, 

3. A monooxygenase as claimed in claim 2, which is derived from 
Bacillus megaterium cytochrome P450 monooxygenase BM-3 with 

15 an amino acid sequence in accordance with SEQ ID NO: 2, which 

has at least one functional mutation in one of the following 
amino acid sequence regions: 24-28, 45-51, 70-72, 73-82, 
86-88, 172-224 und 352-356, with the proviso that, if the 
enzyme carries the mutation F87A, more than one of these 

20 regions is mutated. 

4. A monooxygenase as claimed in claim 3, which comprises at 
least one functional mutation in the amino acid sequence 
regions 86-88 and 172-224. 



25 



30 



A monooxygenase as claimed in claim 4, which comprises at 
least one of the following cunino acid substitution patterns : 



a) 


F87V; 










b) 


F87A 


L188K; 








c) 


F87V 


L188K; 








d) 


F87A 


L188K 


A7 4G; 






e) 


F87V 


L188K 


A74a; 






f ) 


F87A 


L188K 


A74G 


R4 7F; 




g) 


F87V 


L188K 


A74G 


R4 7F; 




h) 


F87A 


L188K 


A74G 


R4 7F 


V26T; or 


i) 


F87V 


L188K 


A74G 


R47F 


V26T; 



35 



and functional equivalents thereof. 

40 

6. A monooxygenase as claimed in claim 3, which comprises a 

single amino acid substitution from amongst the following: 

a) V26T, 

45 b) R47F, 

c) S72G, 

d) A74G, 
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e) F87V, 

f) Ll88z, where Z is an amino acid selected from amongst K, 
R, W, Q, N, G, A and and 

g) M354T; 

5 

and functional equivalents thereof. 

7. A nucleic acid sequence encoding a monooxygenase as claimed 
in any of the preceding claims and the complementary nucleic 

10 acid sequence thereof. 

8. An expression construct comprising, under the genetic control 
of regulatory nucleic acid sequences, an encoding sequence 
which encompasses a nucleic acid sequence as claimed in claim 

15 7. 

9. A vector which encompasses at least one expression construct 
as claimed in claim 8. 

20 10. A recombinant microorganism which has been transformed with 
at least one vector as claimed in claim 9. 

11- A microorganism as claimed in claim 10, selected from amongst 
bacteria of the genus Escherichia. 

25 

12. A process for the enzymatic production of terminally or 

subterminally hydroxylated aliphatic carboxylic acids, which 
comprises 

30 al ) culturing a recombinant microorganism as claimed in claim 

10 or 11 according to the invention in the presence of a 
culture medium which contains at least one hydroxylatable 
carboxylic acid or at least one hydroxylatable carboxylic 
acid derivative; or 



35 



40 



a2 ) incubating a reaction medium containing at least one 
hydroxylatable carboxylic acid or at least one 
hydroxylatable carboxylic acid derivative with an enzyme 
as claimed in any of claims 1 to 6, and 

b) isolating the resulting hydroxylated product from the 
medium. 



45 



13. 



A method as claimed in claim 12, wherein the hydroxylatable 
carboxylic acid is a C8-C30 monocar boxy lie acid or a 
derivative thereof . 
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14. A method as claimed in claim 13, wherein the hydroxylatable 
carboxylic acid is a C8-Ci2-inonocarboxylic acid or a 
derivative thereof and the monooxygenase used is a mutant as 
claimed in claim 5 . 

5 

15. A method as claimed in claim 13, wherein the hydroxylatable 
carboxylic acid is a Ci2-C3o-monocarboxylic acid or a 
derivative thereof and the monooxygenase employed is a mutant 
selected from amongst the single mutants F87A, F87V, V26T, 

10 S72G, A74G and M354T, and the multiple mutants 

F87A L188K A74G R47F; 
F87V L188K A74G R47F; 
F87A L188K A74G R47F V26T; or 
F87V L188K A74G R47F V26T. 

15 

16. A method as claimed in any of claims 12 to 15, wherein the 
reaction is carried out in the presence of an electron donor 
or a reduction equivalent. 

20 17. A method as claimed in claim 16, wherein the electron donor 
or the reduction equivalent is selected from amongst NADH, 
NADPH and Zn/Co(III) sepulchrate. 
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Abstract 

The present invention relates to modified cytochrome P450 
5 monooxygenases with an altered substrate profile, to nucleic acid 
sequences coding therefor, to expression constructs and vectors, 
to recombinant microorganisms which comprise these vectors, and 
to processes for the microbiological production of terminally or 
subterminally hydroxylated aliphatic carboxylic acids. 
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We (I) beheve that we are (I am) the original, first, and joint (sole) inventor(s) of the subject matter which is claimed and 
for which a patent is sought on the invention entitled 

Modified cytochrome P450 monooxygenases 

the specification of which 
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[] was filed on as 

AppUcation Serial No. 
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[x] was filed as PCT international application 

Number PCT/EP0Q/Q7252 

on July 27, 2000 

and was amended under PCT Article 19 

on (if applicable). 

We (I) hereby state that we (I) have reviewed and understand the contents of the above— identified specification, including 
the claims, as amended by any amendment referred to above. 

We (I) acknowledge the duty to disclose information known to be material to the patentability of this application as 
defined in Section 1.56 of Title 37 Code of Federal Regulations. 

We (I) hereby clahn foreign priority benefits under 35 U.S.C. § 1 19(a>-(d) or § 365(b) of any foreign application(s) for 
patent or inventor's certificate, or § 365(a) of any PCT International application which designated at least one country other 
than the United States, listed below and have also identified below, by checking the box, any foreign application for patent 
or inventor's certificate, or PCT International application having a filing date before that of the application on which priority 
is claimed. Prior Foreign AppIication(s) 
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10011723.6 Germany 10 March 2000 [x] Yes [ ] No 
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We (I) hereby claim the benefit under Title 35, United States Codes, § 119(e) of any United States provisional 
application(s) listed below. 



(Application Number) (Filing Date) 



(Application Number) (Filing Date) 

We (I) hereby claim the benefit under 35 U.S.C. § 120 of any United States application(s), or § 365(c) of any PCX 
International application designating the United States, listed below and, insofar as the subject matter of each of the claims 
of this application is not disclosed in the prior United States or PCX International application in the manner provided by the 
first paragraph of 35 U.S.C. §112,1 acknowledge the duty to disclose information which is material to patentability as defined 
in 37 CFR § 1 .56 which became available between the filing date of the prior apphcation and the national or PCX International 
filing date of this application. 



AppUcation Serial No, Filing Date Status (pending, patented, 

^ abandoned) 



And we (I) hereby appoint Messrs. HERBERT. B. KEIL, Registration Numberl8^962i^ RUSSEL E. WEINKAUF, 

Registration NumberJ_8,495: the address of both being Messrs jCeil & Weinkauf. 1101 Connecticut Ave., KW .,Ji\ashington, 
D.C^^0g2fij;telephone 202-659-0100), our attorneys, with full power of substitution and revocation, to prosecute this 
"application, to make alterations and amendments therein, to sign the drawings, to receive the patent, and to transact all 
business in the Patent Office connected therewith. 

We (I) declare that all statements made herein of our (my) own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge that willful 
false statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the application or any patent issuing 
thereon. 
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SEQUENCE LISTING 



<110> Hauer, Bernhard 
Pleiss, Jurgen 
Schwaneberg, Ulrich 
Schmitt, Jutta 

<12 0> Modified cytochrome P450 monooxygenases 

<130> M/40434 

<140> US 10/031,695 

<150> PCT/EPOO/07252 
<151> 2000-07-27 

<160> 20 

<170> Patentin Ver. 2.1 



<210> 1 
<211> 3150 
<212> DNA 

<213> Bacillus megaterium 



<220> 
<221> CDS 
<222> (4) , 



(3150) 



<400> 1 



atg aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa 
Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
15 10 15 



48 



aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
20 25 30 



96 



att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 
lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 



144 



gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tge gat 
Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 



192 



gaa tea ege ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 



240 



gat ttt gca gga gac ggg tta ttt aca age tgg acg eat gaa aaa aat 
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
80 85 90 95 



288 



tgg aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca 
Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 
100 105 110 



336 



2 



atg aaa ggc tat cat gcg atg atg gtc gat ate gcc gtg cag ctt gtt 3 84 
Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 
115 120 125 

caa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 432 
Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu 
130 135 140 

gac atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac 480 
Asp Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn 
145 150 155 

tat cgc ttt aac age ttt tac ega gat cag cct cat cca ttt att aca 528 
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 
160 165 170 175 

agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca 576 
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 190 

aat cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa 624 
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 672 
Asp lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg 
210 215 220 

aaa gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac 72 0 
Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 

225 230 235 

gga aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc 768 
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg 
240 245 250 255 

tat caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt 816 
Tyr Gin He lie Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 

ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 864 
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 

caa aaa gca gca gaa gaa gca gca cga gtt eta gta gat cct gtt cca 912 
Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 

age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac 960 
Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tee eta tat gca 1008 
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 

320 325 330 335 

aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 1056 
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 
340 345 350 
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gaa eta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg 1104 
Glu Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp 
355 360 365 

gga gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt 1152 
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

gcg att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg 1200 
Ala lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt 1248 
Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
400 405 410 415 

atg atg eta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg 1296 
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 
420 425 430 

gat att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa 1344 
Asp lie Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
435 440 445 

gca aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act 13 92 
Ala Lys Ser Lys Lys lie Pro Leu Gly Gly lie Pro Ser Pro Ser Thr 
450 455 460 

gaa cag tct get aaa aaa gta cgc aaa aag gca gaa aac get cat aat 144 0 
Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 

465 470 475 

acg ccg ctg ctt gtg eta tac ggt tea aat atg gga aca get gaa gga 1488 
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
480 485 490 495 

acg gcg cgt gat tta gca gat att gca atg age aaa gga ttt gca ccg 1536 
Thr Ala Arg Asp Leu Ala Asp lie Ala Met Ser Lys Gly Phe Ala Pro 
500 505 510 

eag gtc gca acg ctt gat tea cac gee gga aat ctt ccg cgc gaa gga 1584 
Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 

get gta tta att gta acg gcg tct tat aac ggt eat ccg cct gat aac 1632 
Ala Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

gca aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta 1680 
Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 

aaa ggc gtt cgc tac tec gta ttt gga tge ggc gat aaa aac tgg get 172 8 
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
560 565 570 575 

act acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gee get 1776 
Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 
580 585 590 
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aaa ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac 1824 
Lys Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

gac ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac 1872 
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

gta gca gcc tac ttt aac etc gac att gaa aac agt gaa gat aat aaa 192 0 
Val Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 

tct act ctt tea ett caa ttt gtc gac age gee geg gat atg ecg ctt 1968 
Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
640 645 650 655 

gcg aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa 2016 
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 

ctt caa cag cca ggc agt gca cga age acg ega cat ctt gaa att gaa 2064 
Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 
675 680 685 

ctt cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 2112 
Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie 

690 695 700 

cet cgc aac tat gaa gga ata gta aac cgt gta aca gca agg tte ggc 2160 
Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 

eta gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta 22 08 
Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 
720 725 730 735 

get cat ttg cca etc get aaa aca gta tec gta gaa gag ett ctg caa 2256 
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 

tac gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg 2304 
Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 

get get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg 2352 
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

ett gaa aag caa gee tac aaa gaa caa gtg ctg gca aaa cgt tta aca 2400 
Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 

atg ctt gaa ctg ctt gaa aaa tac ccg geg tgt gaa atg aaa ttc age 2448 
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
800 805 810 815 

gaa ttt ate gee ett ctg cca age ata cgc ccg cgc tat tac teg att 2496 
Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 
820 825 830 
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tct tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age 2544 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 
835 840 845 

gtt gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att 2592 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

gcg teg aac tat ctt gee gag ctg caa gaa gga gat aeg att acg tgc 2 64 0 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
865 870 875 

ttt att tec aca ccg eag tea gaa ttt acg ctg eea aaa gae cct gaa 2688 
Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
880 885 890 895 

acg ccg ctt ate atg gtc gga ccg gga aca gge gtc gcg ccg ttt aga 2736 
Thr Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 

gge ttt gtg eag gcg cgc aaa eag eta aaa gaa caa gga cag tea ctt 2 784 
Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

gga gaa gca cat tta tac ttc gge tgc cgt tea cct eat gaa gae tat 2832 
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 

930 935 940 

Ctg tat caa gaa gag ctt gaa aac gee caa age gaa gge ate att aeg 2880 
Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie He Thr 
945 950 955 

ctt cat ace get ttt tct ege atg eea aat cag ccg aaa aca tac gtt 2 928 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
960 965 970 975 

cag cac gta atg gaa caa gae gge aag aaa ttg att gaa ctt ett gat 2976 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp 
980 985 990 

caa gga gcg cac ttc tat att tgc gga gae gga age caa atg gca cct 3024 
Gin Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 



gee gtt gaa gca aeg ett atg aaa age tat get gae gtt cac caa gtg 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 



3072 



agt gaa gca gae get cgc tta tgg. ctg cag eag eta gaa gaa aaa gge 3120 
Ser Glu Ala Asp Ala Arg Leu Trp ieu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 • 1035 

ega tac gca aaa gae gtg tgg get ggg taa 3150 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 



<210> 2 
<211> 1048 



<212> PRT 

<213> Bacillus Tnegaterium 



<400> 2 

Thr lie Lys Glu 
1 

Leu Pro Leu Leu 
20 

Ala Asp Glu Leu 
35 

Thr Arg Tyr Leu 
50 

Ser Arg Phe Asp 
65 

Phe Ala Gly Asp 



Lys Lys Ala His 
100 

Lys Gly Tyr His 
115 

Lys Trp Glu Arg 
130 

Met Thr Arg Leu 
145 

Arg Phe Asn Ser 



Met Val Arg Ala 
180 

Pro Asp Asp Pro 
195 

lie Lys Val Met 

210 

Ala Ser Gly Glu 
225 

Lys Asp Pro Glu 



Gin He He Thr 
260 

Leu Ser Phe Ala 
275 



Met Pro Gin Pro 

5 

Asn Thr Asp Lys 



Gly Glu He Phe 
40 

Ser Ser Gin Arg 

55 

Lys Asn Leu Ser 
70 

Gly Leu Phe Thr 

85 

Asn He Leu Leu 



Ala Met Met Val 
120 

Leu Asn Ala Asp 

135 

Thr Leu Asp Thr 
150 

Phe Tyr Arg Asp 
165 

Leu Asp Glu Ala 



Ala Tyr Asp Glu 
200 

Asn Asp Leu Val 

215 

Gin Ser Asp Asp 
230 

Thr Gly Glu Pro 
245 

Phe Leu He Ala 



Leu Tyr Phe Leu 
280 



Lys Thr Phe Gly 
10 

Pro Val Gin Ala 
25 

Lys Phe Glu Ala 



Leu He Lys Glu 

60 

Gin Ala Leu Lys 
75 

Ser Trp Thr His 

90 

Pro Ser Phe Ser 
105 

Asp He Ala Val 



Glu His He Glu 
140 

He Gly Leu Cys 
155 

Gin Pro His Pro 
170 

Met Asn Lys Leu 
185 

Asn Lys Arg Gin 



Asp Lys He He 
220 

Leu Leu Thr His 
235 

Leu Asp Asp Glu 
250 

Gly His Glu Thr 
265 

Val Lys Asn Pro 



Glu Leu Lys Asn 

15 

Leu Met Lys He 
30 

Pro Gly Arg Val 
45 

Ala Cys Asp Glu 



Phe Val Arg Asp 
80 

Glu Lys Asn Trp 
95 

Gin Gin Ala Met 
110 

Gin Leu Val Gin 
125 

Val Pro Glu Asp 



Gly Phe Asn Tyr 
160 

Phe He Thr Ser 
175 

Gin Arg Ala Asn 
190 

Phe Gin Glu Asp 
205 

Ala Asp Arg Lys 



Met Leu Asn Gly 
240 

Asn He Arg Tyr 
255 

Thr Ser Gly Leu 
270 

His Val Leu Gin 
285 
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Lys Ala Ala Glu 
290 

Tyr Lys Gin Val 
305 

Ala Leu Arg Leu 



Glu Asp Thr Val 
340 

Leu Met Val Leu 
355 

Asp Asp Val Glu 
370 

lie Pro Gin His 

385 

He Gly Gin Gin 



Met Leu Lys His 
420 

He Lys Glu Thr 
435 

Lys Ser Lys Lys 
450 

Gin Ser Ala Lys 
465 

Pro Leu Leu Val 



Ala Arg Asp Leu 
500 

Val Ala Thr Leu 

515 

Val Leu He Val 
530 

Lys Gin Phe Val 

545 

Gly Val Arg Tyr 



Thr Tyr Gin Lys 
580 

Gly Ala Glu Asn 
595 



Glu Ala Ala Arg 
295 

Lys Gin Leu Lys 
310 

Trp Pro Thr Ala 
325 

Leu Gly Gly Glu 



He Pro Gin Leu 
360 

Glu Phe Arg Pro 
375 

Ala Phe Lys Pro 

390 

Phe Ala Leu His 
405 

Phe Asp Phe Glu 



Leu Thr Leu Lys 
440 

He Pro Leu Gly 
455 

Lys Val Arg Lys 
470 

Leu Tyr Gly Ser 
485 

Ala Asp He Ala 



Asp Ser His Ala 
520 

Thr Ala Ser Tyr 
535 

Asp Trp Leu Asp 
550 

Ser Val Phe Gly 
565 

Val Pro Ala Phe 



He Ala Asp Arg 
600 



Val Leu Val Asp 
300 

Tyr Val Gly Met 
315 

Pro Ala Phe Ser 
330 

Tyr Pro Leu Glu 
345 

His Arg Asp Lys 



Glu Arg Phe Glu 
380 

Phe Gly Asn Gly 

395 

Glu Ala Thr Leu 
410 

Asp His Thr Asn 
425 

Pro Glu Gly Phe 



Gly He Pro Ser 
460 

Lys Ala Glu Asn 
475 

Asn Met Gly Thr 
490 

Met Ser Lys Gly 
505 

Gly Asn Leu Pro 



Asn Gly His Pro 
540 

Gin Ala Ser Ala 
555 

Cys Gly Asp Lys 
570 

He Asp Glu Thr 
585 

Gly Glu Ala Asp 



Pro Val Pro Ser 



Val Leu Asn Glu 

320 

Leu Tyr Ala Lys 
335 

Lys Gly Asp Glu 
350 

Thr He Trp Gly 

365 

Asn Pro Ser Ala 



Gin Arg Ala Cys 
400 

Val Leu Gly Met 
415 

Tyr Glu Leu Asp 
430 

Val Val Lys Ala 

445 

Pro Ser Thr Glu 



Ala His Asn Thr 
480 

Ala Glu Gly Thr 
495 

Phe Ala Pro Gin 
510 

Arg Glu Gly Ala 

525 

Pro Asp Asn Ala 



Asp Glu Val Lys 
560 

Asn Trp Ala Thr 

575 

Leu Ala Ala Lys 
590 

Ala Ser Asp Asp 
605 
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Phe Glu Gly Thr 
610 

Ala Ala Tyr Phe 
625 

Thr Leu Ser Leu 



Lys Met His Gly 
660 

Gin Gin Pro Gly 
675 

Pro Lys Glu Ala 
690 

Arg Asn Tyr Glu 
705 

Asp Ala Ser Gin 



His Leu Pro Leu 
740 

Val Glu Leu Gin 
755 

Ala Lys Thr Val 
770 

Glu Lys Gin Ala 
785 

Leu Glu Leu Leu 



Phe lie Ala Leu 
820 

Ser Ser Pro Arg 
835 

Val Ser Gly Glu 
850 

Ser Asn Tyr Leu 
865 

lie Ser Thr Pro 



Pro Leu lie Met 
900 

Phe Val Gin Ala 
915 



Tyr Glu Glu Trp 
615 

Asn Leu Asp lie 

630 

Gin Phe Val Asp 
645 

Ala Phe Ser Thr 



Ser Ala Arg Ser 
680 

Ser Tyr Gin Glu 
695 

Gly lie Val Asn 

710 

Gin lie Arg Leu 
725 

Ala Lys Thr Val 



Asp Pro Val Thr 
760 

Cys Pro Pro His 
775 

Tyr Lys Glu Gin 
790 

Glu Lys Tyr Pro 
805 

Leu Pro Ser lie 



Val Asp Glu Lys 
840 

Ala Trp Ser Gly 
855 

Ala Glu Leu Gin 

870 

Gin Ser Glu Phe 
885 

Val Gly Pro Gly 



Arg Lys Gin Leu 
920 
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Arg Glu His Met 
620 

Glu Asn Ser Glu 
635 

Ser Ala Ala Asp 
650 

Asn Val Val Ala 
665 

Thr Arg His Leu 



Gly Asp His Leu 
700 

Arg Val Thr Ala 

715 

Glu Ala Glu Glu 
730 

Ser Val Glu Glu 
745 

Arg Thr Gin Leu 



Lys Val Glu Leu 
780 

Val Leu Ala Lys 

795 

Ala Cys Glu Met 
810 

Arg Pro Arg Tyr 
825 

Gin Ala Ser lie 



Tyr Gly Glu Tyr 
860 

Glu Gly Asp Thr 
875 

Thr Leu Pro Lys 
890 

Thr Gly Val Ala 
905 

Lys Glu Gin Gly 



Trp Ser Asp Val 



Asp Asn Lys Ser 

640 

Met Pro Leu Ala 
655 

Ser Lys Glu Leu 
670 

Glu lie Glu Leu 
685 

Gly Val lie Pro 



Arg Phe Gly Leu 

720 

Glu Lys Leu Ala 
735 

Leu Leu Gin Tyr 
750 

Arg Ala Met Ala 
765 

Glu Ala Leu Leu 



Arg Leu Thr Met 

800 

Lys Phe Ser Glu 
815 

Tyr Ser lie Ser 
830 

Thr Val Ser Val 

845 

Lys Gly lie Ala 



lie Thr Cys Phe 
880 

Asp Pro Glu Thr 
895 

Pro Phe Arg Gly 
910 

Gin Ser Leu Gly 
925 
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Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr Leu 

945 950 955 960 

His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 
965 970 975 

His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp Gin 
980 985 990 

Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 1000 1005 

Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val Ser 
1010 1015 1020 

Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly Arg 
1025 1030 1035 1040 

Tyr Ala Lys Asp Val Trp Ala Gly 
1045 



<210> 3 
<211> 30 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 

<222> 1 . . 30 

<22 3> n is a or g or c or t/u, unknown, or other 



<400> 3 



gcaggagacg ggttgnnnac aagctggacg 



<210> 4 
<211> 30 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1 . . 30 

<223> n is a or g or c or t/u, unknown, or other 
<400> 4 

cgtccagctt gtnnncaacc cgtctcctgc 
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<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1. .34 

<223> n is a or g or c or t/u, unknown, or other 
<400> 5 

gaagcaatga acaagnnnca gcgagcaaat ccag 



<210> 6 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1 . .34 

<223> n is a or g or c or t/u, unknovm, or other 
<400> 6 

ctggatttgc tcgctgnnnc ttgttcattg cttc 



<210> 7 
<211> 41 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1. .41 

<223> n is a or g or c or t/u, unknown, or other 

<400> 7 

gctttgataa aaacttaaag tcaannnctt aaatttgtac g 



<210> 8 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence : PGR primer 



<221> unsure 
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<222> 1. .40 

<223> n is a or g or c or t/u, unknown, or other 



. <400> 8 



cgtacaaatt taagnnnttg acttaagttt ttatcaaagc 



40 



<210> 9 
<211> 37 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1 . . 37 

<223> n is a or g or c or t/u, unknown, or other 



<210> 10 
<211> 37 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1 . . 37 

<223> n is a or g or c or t/u, unknown, or other 
<400> 10 

catcaaagct tgaacnnntt tatctgtgtt taataac 37 



<210> 11 

<211> 37 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1 . .37 

<22 3> n is a or g or c or t/u, unknown, or other 



<400> 



9 



gttattaaac acagataaan nngttcaagc tttgatg 



37 



<400> 11 



gttattaaac acagataaac cgnnncaagc tttgatg 



37 
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<210> 12 
<211> 37 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1 . .37 

<223> n is a or g or c or t/u, unknown, or other 



<400> 12 



catcaaagct tgnnncggtt tatctgtgtt taataac 



<210> 13 
<211> 33 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1. .33 

<223> n is a or g or c or t/u, unknown, or other 
<400> 13 

cgaggcgcct ggtnnngtaa cgcgctactt ate 



<210> 14 
<211> 33 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: PCR-primer 

<221> unsure 
<222> 1 . .33 

<223> n is a or g or c or t/u, unknown, or other 



<400> 14 



gataagtagc gcgttacnnn accaggcgcc teg 

<210> 15 
<211> 34 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 

<222> 1 . . 34 

<223> n is a or g or c or t/u, unknown, or other 
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<400> 15 



cctggtcgtg taacgcgcnn nttatcaagt cage 34 

<210> 16 
<211> 34 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: PCR primer 

<221> unsure 
<222> 1. .34 

<223> n is a or g or c or t/u, unknown, or other 



<400> 16 



gctgacttga taannngcgc gttacacgac cagg 34 

<210> 17 
<211> 40 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: PCR primer 

<221> unsure 
<222> 1. .40 

<223> n is a or g or c or t/u, unknown, or other 



<400> 17 

gctttgataa aaacttannn caagcgctta aatttgtacg 40 

<210> 18 
<211> 40 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: PCR primer 

<221> unsure 
<222> 1 . .40 

<223> n is a or g or c or t/u, unknown, or other 

<400> 18 

cgtacaaatt taagcgcttg nnntaagttt ttatcaaagc 40 

<210> 19 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PCR primer 
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<221> unsure 
<222> 1. .30 

<223> n is a or g or c or t/u, unknown, or other 
<400> 19 

ggcgacgaac tannngttct gattcctcag 



<210> 20 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 

<221> unsure 
<222> 1 . . 30 

<223> n is a or g or c or t/u, unknown, or other 
<400> 20 

ctgaggaatc agaacnnnta gttcgtcgcc 



